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Abstract 

Experimental  psychology,  cognitive  science,  and  human-factors  engineering  have  progressed 
sufficiently  far  that  a  practical  unified  theory  of  cognition  and  action  is  now  foreseeable.  Such  a 
theory  soon  may  yield  useful  quantitative  predictions  about  rapid  human  multiple-task  performance 
in  applied  settings.  Toward  this  end,  an  Executive-Process/Interactive-Control  (EPIC)  architecture 
has  been  formulated  with  components  whose  assumed  properties  emulate  fundamental  perceptual, 
cognitive,  and  motor  processes.  On  the  basis  of  EPIC,  a  theorist  may  construct  detailed 
computational  models  that  characterize  multiple-task  performance  under  both  laboratory  and  real- 
world  conditions.  For  example,  EPIC  computational  models  provide  good  accounts  of  response 
latencies  and  accuracies  from  the  psychological  refractory-period  procedure,  aircraft  cockpit 
operation,  and  human-computer  interaction.  As  a  result,  major  commonalities  in  performance  across 
various  task  domains  have  been  discovered,  and  efficacious  principles  for  designing  person-machine 
interfaces  have  been  identified.  The  substantive  and  methodological  lessons  learned  from  these 
advances  constitute  an  instructive  precis  to  further  utilitarian  theoretical  unification. 


*  This  document  is  a  preprint  of  a  chapter  to  be  published  in  the  book  Attention  and  Performance  XVII  (Gopher  & 
Koriat,  in  press).  The  chapter  is  based  on  an  invited  lecture  presented  at  the  Seventeenth  International  Symposium  on 
Attention  and  Performance  in  Haifa,  Israel,  during  July,  1996.  We  thank  Daniel  Gopher  and  Asher  Koriat  for  organizing 
the  symposium.  Funding  to  prepare  the  chapter  was  provided  by  the  United  States  Office  of  Naval  Research  (ONR) 
through  grant  N00014-92-J-i  173  to  the  University  of  Michigan,  David  E.  Kieras  and  David  E.  Meyer,  Principal 
Investigators.  We  thank  Terry  Allard,  Susan  Chipman,  Harold  Hawkins,  and  Willard  Vaughan  of  the  ONR  for  their 
continuing  encouragement  and  support.  Additional  funding  was  provided  by  the  research  laboratory  of  NYNEX. 
Research  assistance  and  scientific  contributions  by  past  and  present  members  of  the  Brain,  Cognition,  and  Action 
Laboratory  at  the  University  of  Michigan  (Dana  Apfelblat,  David  Fencsik,  Jennifer  Glass,  Leon  Gmeindl,  Cerita  Jones, 
Ryan  Ketter,  Erick  Lauber,  Shane  Mueller,  Eric  Schumacher,  Molly  Schweppe,  Travis  Seymour,  B.  J.  Woodside,  and 
Eileen  Zurbriggen)  are  gratefully  acknowledged.  Comments  by  Eric  Schumacher  on  an  initial  draft  of  this  chapter  are 
appreciated  as  well.  Correspondence  may  be  addressed  to  David  E.  Meyer,  Cognition  and  Perception  Program,  Dept,  of 
Psychology,  University  of  Michigan,  525  E.  University,  Ann  Arbor,  MI,  48109-1 109,  USA.  Electronic  mail  may  be  sent 
to  demeyer@umich.edu. 
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Introduction 

Like  the  quest  of  Indiana  Jones,  the  adventurous  anthropologist  in  Raiders  of  The  Lost  Ark 
(Kasdan,  Lucas,  &  Kaufman,  1981),  our  journey  to  the  Holy  Land  fox  Attention  and  Performance 
XVII  has  brought  us  in  search  of  an  alluring  mystical  treasure.  The  treasure  that  we  seek  is  a  unified 
theory  of  cognition  and  action  through  which  human  performance  can  be  understood  and  predicted 
in  a  variety  of  contexts,  spanning  elementary  laboratory  paradigms  and  complex  real-world 
situations.  Although  not  so  sacred  as  the  Lost  Ark  of  the  Covenant,  such  a  theory  would  have  great 
value  for  both  applied  psychological  science  and  the  present  book.  By  design,  this  book  concerns 
the  cognitive  reflation  of  human  performance,  with  special  emphasis  on  interactions  between 
theory  and  practical  applications.  Ultimately,  these  interactions  and  future  progress  from  them  will 
grow  best  if  researchers  succeed  at  constmcting  a  coherent  conceptual  framework  in  which  scientific 
knowledge  is  synthesized  about  several  complementary  topics,  including  goal-directed  behavior, 
top-down  supervisory  executive  processes,  performance  strategies,  attentional  control  mechanisms, 
and  conscious  appraisal  of  the  world.  As  one  kindred  seeker  enthusiastically  proclaimed:  "There's 
nothing  so  useful  as  a  good  theory"  (Gopher,  1996).' 

Moving  onward  with  such  enthusiasm,  the  remainder  of  this  chapter  is  organized  as  follows. 

We  start  by  reviewing  some  past  history  that  has  paved  the  way  for  our  efforts  toward  formulating  a 
unified  theoiy  of  cognition  and  action.  On  the  basis  of  these  prior  developments,  a  functional 
architecture  is  introduced  for  emulating  fundamental  characteristics  of  the  human  information¬ 
processing  system.  Next,  using  this  architecture,  illustrative  computational  models  are  constructed 
to  account  for  observed  aspects  of  multiple-task  performance  in  typical  laboratory  paradigms  and 
real-world  situations.  From  these  accounts,  some  instructive  lessons  are  derived  with  respect  to  how 
further  attempts  at  theoretical  unification  should  proceed.  Although  the  chapter  does  not  end  with  a 
complete  veridical  unified  theory,  it  may  provide  some  intermediate  guides  along  the  way  to  this 
objective.  Additional  guidelines  of  this  sort  appear  in  several  other  related  publications  (e.g.,  Kieras 
&  Meyer,  1995,  1997;  Kieras,  Wood,  &  Meyer,  1997;  Meyer  &  Kieras,  1994, 1997a,  1997b;  Meyer, 
Kieras,  Lauber,  Schumacher,  Glass,  Zurbriggen,  Gmeindl,  &  Apfelblat,  1995). 


Historical  Background 

Our  efforts  toward  formulating  a  unified  theory  of  cognition  and  action  have  been  inspired  by 
several  prescient  prophets  of  experimental  psychology,  cognitive  science,  and  human-factors 
engineering.  Among  them,  an  especially  noteworthy  sage  was  the  late  Allen  Newell,  who,  nearly  a 
quarter  century  before  Attention  and  Performance  XVII,  published  an  influential  chapter  entitled 
"You  can't  play  20  questions  with  nature  and  win"  (Newell,  1973a).  Newell's  thesis  was  that 
experimental  psychology  has  neglected  to  take  sufficient  stock  of  the  big  theoretical  picture  toward 
which  it  should  contribute,  thereby  hindering  cumulative  scientific  progress  in  the  study  of  mind  and 
behavior.  Rather,  as  Newell  saw  it,  too  much  effort  has  been  expended  on  conducting  narrow 
empirical  studies  to  test  seductively  simple  binary  hypotheses  (e.g.,  early  versus  late  attentional 
selection,  serial  versus  parallel  memory  search,  and  imaginal  versus  propositional  knowledge). 

These  studies  have  accumulated  an  impressively  large  collection  of  basic  facts,  on  the  order  of  3000 
"good  quantitative  regularities"  (e.g.,  see  Atkinson,  Hernstein,  Lindzey,  &  Luce,  1988;  Boff, 
Kaufman,  &  Thomas,  1986;  Meyer  &  Kornblum,  1993),  but  how  they  all  fit  together  theoretically 
remains  a  Great  Psychology  Data  Puzzle  (Newell,  1992).2  There  are  still  no  general  theories  that 


'  The  original  form  of  this  proclamation  is  attributable  to  Lewin  (1951),  who  said  "there's  nothing  so  practical  as  a  good 
theory." 

2  For  example,  one  impressive  illustration  of  the  abundant  quantitative  regularities  now  available  from  experimental 
psychology  has  been  provided  by  Salthouse  (1986).  He  tabulated  29  systematic  phenomena  discovered  about 
transcription  typing.  Presumably  they  stem  from  interactive  properties  of  skilled  typists’  perception,  attention,  memory, 
and  motor  control  processes.  Yet  the  exact  nature  of  these  interactions  and  the  processes  that  mediate  them  remains  to 
be  ellucidated  on  the  basis  of  a  theory  that  also  accounts  for  other  types  of  regularity  (cf.  John,  1988;  Rumelhart  & 
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have  adequate  practical  utility  across  many  domains  of  application.  Instead,  what  psychological 
theorizing  has  produced  thus  far  is  mostly  just  a  set  of  unrelated  micromodels  that  are  relevant  only 
to  separate  small  families  of  empirical  phenomena  in  limited  artificial  contexts. 

Characteristics  of  Unified  Theories 

To  go  beyond  these  confines,  Newell  (1990, 1992)  and  his  colleagues  (e.g..  Card,  Moran,  & 
Newell,  1983;  Laird,  Newell,  &  Rosenbloom,  1987)  have  advocated  the  development  of  unified 
theories  of  cognition  (UTCs).  An  ideal  UTC  would  postulate  "a  single  system  of  mechanisms  that 
operate  together  to  produce  the  full  range  of  human  cognition"  (Newell,  1990,  p.  1).  The  motivation 
for  this  approach  was  summarized  in  Newell's  original  chapter  on  the  futility  of  simply  playing  20 
questions  with  nature: 

"Our  task  in  psychology  is  first  to  discover  the  invariant  structure  of  processing  mechanisms.... 
Without  such  a  framework  in  which  to  work,  the  generation  ...  of  new  explanations  for  old 
phenomena  will  go  on  ad  nauseum"  (Newell,  1973a,  pp.  293,  296). 

As  envisioned  by  Newell,  the  systems  on  which  UTCs  are  based  conceptually  should  have 
detailed  information-processing  architectures  with  set  interconnected  components  that  implement 
elementary  symbolic  computational  processes  for  perception,  cognition,  and  action.  A  complete 
UTCs  architecture  ought  to  be  sufficiently  powerful  that  programs  executed  by  it  can  accurately 
simulate  covert  mental  processes  and  overt  behavior  associated  with  learning,  memory,  perceptual- 
motor  skills,  language  comprehension,  decision  making,  problem  solving,  and  other  complex 
functions.  In  Newell's  (1990,  1992)  opinion,  more  than  enough  empirical  data  are  available  now  for 
starting  to  formulate  UTCs,  so  experimental  psychology  and  cognitive  science  must  make  theoretical 
unification  be  an  immediate  principal  goal.  Some  prominent  contributors  to  Attention  and 
Performance  symposia  have  called  as  well  for  such  unification: 

"What  is  urgently  needed  is  ...  a  computational  theory,  in  the  sense  outlined  by  Marr  (1982),  of 
the  many  different  functions  of  attentional  selectivity  and  control ...  taking  seriously  the  idea 
that  attentional  functions  are  of  many  different  kinds,  serving  a  great  range  of  different 
computational  purposes"  (Allport,  1993,  pp.  205-206). 

"We  need  computational  theories  of  interaction  between  stages.  As  the  number  of  theoretical 
entities  increases  in  each  area,  it  becomes  increasingly  hard  to  see  the  implications  of  combining 
them.  Only  computational  systems  can  do  this,  and  they  will  have  the  merit  of  stopping  the 
laxness  of  definition  noted  by  Allport"  (Broadbent,  1993,  p.  876). 

Indeed,  UTCs  may  provide  numerous  complementary  benefits,  which  include  amortization  of 
theoretical  constructs,  integration  of  multiple  empirical  constraints,  maximization  of  process 
identifiability,  solution  of  irrelevant-specification  problems,  absolution  from  Popperian  damnation, 
amplification  of  scientific  progress,  and  promotion  of  practical  applications  (Newell,  1990,  p.  18). 
By  combining  extant  theoretical  constracts  from  diverse  sources  into  one  integrated  framework,  a 
UTC  repays  debts  owed  to  past  sponsors  of  experimental  psychology.  The  prospective  payoff  is 
enhanced  because  UTCs  account  for  a  wide  variety  of  data  whose  overall  pattern  imposes  multiple 
empirical  constraints  on  functional  properties  that  a  theory's  mechanisms  must  have.  In  essence,  this 
maximizes  process  identifiability  and  helps  clarify  what  canonical  assumptions  are  most  appropriate. 
Given  maximum  process  identifiability,  the  theorist  can  forego  appending  "Rube  Goldberg  kludges" 
as  part  of  the  system,  thereby  solving  irrelevant-specification  problems  (i.e.,  haphazard  postulation 
of  arbitrary  components  that  contribute  to  explanations  in  an  unprincipled  fashion). 


Norman,  1982).  If  there  are  on  the  order  of  30  known  regularities  relevant  to  transcription  typing,  a  rather  delimited 
performance  domain,  then  it  is  easy  to  imagine  that  research  on  human  performance  already  has  found  3000  or  more 
regularities  overall. 
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As  a  result,  there  may  come  absolution  from  Popperian  damnation,  that  is,  forgiveness  for 
having  proposed  simple  binary  hypotheses  with  which  aberant  bits  of  data  are  inconsistent  (cf. 
Popper,  1959).  Such  absolution  can  facilitate  scientific  progress;  no  longer  will  it  be  necessary  to 
formulate,  test,  reject,  and  reformulate  simple  theoretical  alternatives  repeatedly  in  an  endless 
Karmic  birth-death  cycle  of  the  same  old  hypotheses.^  Through  the  progress  that  UTCs  enable,  they 
can  be  taken  into  the  field  for  useful  practical  applications.  "A  unified  theory  of  cognition  is  the  key 
to  successful  applied  cognitive  science"  (Newell,  1990,  p.  498).  This  follows  because  important 
real-world  tasks  engage  many  aspects  of  cognition,  and  effective  behavior  under  such  circumstances 
depends  on  interactions  among  many  information-processing  components,  so  successful  applications 
must  treat  all  of  these  components  in  an  integrated  fashion.^ 

Furthermore,  the  connection  between  application  and  theoretical  unification  should  be  taken  as 
a  two-way  street.  While  UTCs  lead  to  useful  applications,  serious  concern  about  practical 
applications  can  foster  substantial  theoretical  development  and  unification.  Again  in  Newell's 
words: 

"Applications  provide  crucial  ingredients  for  the  overall  basic  scientific  enterprise.  [They]  are 
critical  for  the  internal  conduct  of  the  science.  They  establish  what  is  worth  predicting.  They 
establish  what  accuracy  is  sufficient.  They  establish  when  a  regularity  is  worth  remembering. 
They  establish  when  a  theory  should  not  be  discarded....  Applications  have  a  wisdom  that  the 
current  fashions  of  theory  do  not"  (Newell,  1990,  pp.  501-502)". 

Mindful  of  these  words  and  also  those  spoken  by  John  F.  Kennedy,  we  therefore  should  ask  not  just 
what  unified  theories  of  cognition  can  do  for  applications,  but  also  what  applications  can  do  for 
cognitive  theories  (Newell,  1990,  p.  500).^ 

Harbingers  of  Unified  Theories 

Since  Newell's  (1973a)  original  manifesto,  no  complete  veridical  UTC  has  been  developed  yet. 
Nevertheless,  some  promising  harbingers  of  theoretical  unification  have  appeared  on  the  scene  (for  a 
comparative  review  and  evaluation,  see  Newell,  1990,  pp.  23-36).  These  include  the  Model  Human 
Processor  of  Card  et  al.  (1983),  the  ACT*  system  of  J.  R.  Anderson  (1983),  and  the  SOAR  system  of 
Newell  (1990,  1992)  and  his  colleagues  (Laird  et  al.,  1987).® 


^  Newell's  (1973a,  1990, 1992)  teachings  have  many  interesting  parallels  with  those  of  the  Buddha.  For  example, 
according  to  the  Buddha,  the  way  to  escape  perpetual  mental  suffering  is  to  contemplate  and  unify  various  basic  inner 
psychological  and  outer  environmental  realms  (Bukkyo  Dendo  Kyokai,  1985).  Similarly,  according  to  Newell,  release 
from  intellectual  turmoil  in  experimental  psychology  and  cognitive  science  will  come  through  deep  theoretical 
contemplation  and  unification. 

As  the  above  discussion  implies,  applied  cognitive  science  must  find  a  way  to  sail  successfully  between  what  Sanders 
(1991)  called  the  "Scylla"  of  experimental  psychology  and  the  "Charybdis"  of  human-factors  engineering.  In  his  view, 
experimental  psychology  has  formed  a  massive  Scylla  of  relatively  simple  laboratory  phenomena  that  are  easy  to 
measure  but  that  also  are  separate  from  the  "richness  of  reality",  whereas  human-factors  engineering  swirls  with  the 
treacherous  Charybdis  of  complex  phenomena  that  are  difficult  to  analyze  but  that  also  are  typical  of  real  life  (Sanders, 
1991,  p.  997).  The  development  of  UTCs  can  help  us  chart  a  course  through  the  gap  that  separates  these  two  perils. 

®  For  example,  consider  circumstances  that  involve  Fitts'  (1954)  law,  under  which  the  mean  duration  of  rapid  aimed 
movements  to  a  target  region  is  a  logarithmic  function  of  the  target  distance  divided  by  the  target  width.  In  numerous 
applied  settings  where  aimed  movements  must  be  made,  Fitts'  law  prevails  (Meyer,  Abrams,  Komblum,  Wright,  & 
Smith,  1988).  This  prevalence  suggests  that  Fitts'  law  is  a  fundamental  regularity  whose  underlying  mechanisms  should 
play  a  role  as  part  of  any  veridical  UTC  (cf.  Newell,  1990,  pp.  3-6). 

®  There  are  also  other  major  harbingers  of  UTCs  (e.g.,  Just  &  Carpenter,  1987;  Norman  &  Shallice,  1986;  Schneider  & 
Detweiler,  1987).  More  consideration  of  these  may  be  found  elsewhere  (Meyer  &  Kieras,  1997a;  Newell,  1990). 
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Model  Human  Processor.  The  Model  Human  Processor  (MHP)  was  developed  specifically  for 
applications  to  human-computer  interaction  (HCI).  To  predict  the  speed  and  accuracy  of  people's 
performance  during  HCI  tasks  such  as  text  editing,  Card  et  al.  (1983)  endowed  the  MHP  with  a 
combination  of  general-purpose  memory  stores  and  processing  units  whose  functional  characteristics 
approximated  those  of  the  human  information-processing  system.  Among  the  MHP’s  memory  stores 
are  a  short-term  working  memory  and  a  long-term  declarative  memory.  They  involve  putative 
information  codes,  storage  capacities,  and  durations  consistent  with  generic  data  available  at  the  time 
of  the  MHP’s  formulation.  Complementing  these  memory  stores,  among  the  MHP's  processing  units 
are  a  perceptual  processor,  cognitive  processor,  and  motor  processor.  The  assumed  durations  of 
their  operations  have  magnitudes  consistent  with  previously  estimated  times  taken  by  stages  of 
processing  such  as  stimulus  encoding,  response  selection,  and  movement  production  (cf.  Sanders, 
1980;  Sternberg,  1969).  From  integrating  the  processing  units  and  memory  stores  of  the  MHP  in  a 
"boxes  and  arrows"  flowchart,  acceptably  accurate  "engineering  approximations"  of  response  speed 
and  accuracy  across  a  variety  of  HCI  tasks  were  obtained  by  Card  et  al.  (1983).^  At  the  time,  their 
success  demonstrated  that  substantial  data  from  numerous  areas  of  experimental  psychology  indeed 
are  available  for  taking  significant  strides  toward  utilitarian  theoretical  unification. 

Nevertheless,  the  scope  of  the  MHP  is  seriously  limited.  For  example,  its  perceptual  and  motor 
processors  do  not  embody  all  important  characteristics  of  information  processing  in  the  visual, 
auditory,  tactile,  ocular,  articulatory,  and  manual  modalities.  Also,  the  cognitive  processor  and 
working  memory  of  the  MHP  do  not  enable  computer  simulations  of  human  performance  with 
observable  outputs  under  diverse  task  conditions;  that  is.  Card  et  al.  (1983)  did  not  go  much  beyond 
the  boxes-and-arrows  phase  of  theory  development.  As  a  result,  the  MHP  lacks  crucial  features  that 
a  complete  veridical  UTC  should  have. 

ACT*.  In  some  respects,  the  ACT*  system  (J.  R.  Anderson,  1983),  which  evolved  from  J.  R. 
Anderson  and  Bower's  (1973)  Human  Associative  Memory  (HAM)  model  and  J.  R.  Anderson's 
(1976)  ACT  system,  is  more  complete  and  precise  than  Card  et  al.'s  (1983)  MHP.  For  ACT*,  J.  R. 
Anderson  (1983)  distinguished  explicitly  between  procedural  and  declarative  knowledge.  Pursuing 
this  distinction,  he  embodied  procedural  knowledge  in  ACT*  as  a  formal  production-system  under 
which  various  tasks  could  be  performed  through  sets  of  production  (if  condition,  then  action)  rules. 
The  development  of  ACT*  honored  Newell’s  (1973b)  prior  suggestion  that  production  systems 
would  help  construct  more  complete  models  of  human  information  processing.  ACT*’s  production 
system  has  a  rule  interpreter  with  well-defined  properties  (e.g.,  conflict-resolution  criteria).  Under 
this  interpreter,  the  conditions  of  specified  production  rules  are  compared  with  the  current  contents 
of  working  memory,  and  the  rules'  actions  are  executed  contingent  on  the  outcomes  (matches  or 
mismatches)  of  these  comparisons. 

Assumed  details  of  the  ACT*  rule  interpreter,  working  memory,  and  other  ancill^  components 
enabled  J.  R.  Anderson  (1983)  to  account  for  RT  and  accuracy  data  from  comprehension  and 
reasoning  tasks.  Furthermore,  with  algorithms  for  compiling  and  tuning  procedural  knowledge, 
phenomena  of  cognitive-skill  acquisition  (e.g.,  the  power  law  of  practice;  Fitts,  1964)  are  explained 
by  ACT*.*  So  ACT*  has  more  inherent  potential  than  does  the  MHP  to  enable  computer 
simulations  and  to  become  a  bona  fide  UTC. 

Yet  ACT*  also  has  significant  limitations.  Unlike  in  the  MHP,  no  serious  treatment  of  ocular, 
manual,  and  articulatory  motor  control  is  included  thus  far  as  part  of  the  ACT*  architecture.  Nor 
have  initial  applications  of  ACT*  dealt  extensively  with  complex  problem  solving  of  the  sort 
addressed  previously  by  Newell  and  Simon's  (1972)  General  Problem  Solver  (GPS).  Instead,  J.  R. 


^  By  informal  convention,  an  acceptably  accurate  engineering  approximation  is  one  such  that  for  an  empirical  data  set, 
the  values  predicted  or  postdicted  on  the  basis  of  theoretical  calculations  deviate  from  the  observed  values  by  no  more 
than  10%  of  the  observed  values'  magnitudes  (Card  et  al.,  1983;  Newell,  1990). 

*  According  to  the  power  law  of  practice,  the  reaction  time  (RT)  to  complete  one  trial  of  a  task  is  a  power  function  whose 
domain  is  the  number  (AT)  of  prior  practice  trials  on  the  task  and  whose  exponent  is  a  negative  constant.  As  a  result, 
log  RT  would  be  a  negatively  sloped  linear  function  of  log  N  (J.  R.  Anderson,  1983;  Fitts,  1964;  Newell,  1990). 
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Anderson  (1983)  focussed  mainly  on  intermediate  processes  of  learning,  memory,  language 
comprehension,  and  inference. 

SOAR.  The  gaps  left  by  ACT*  and  the  MHP  lead  us  to  the  SOAR  system  of  Newell  (1990, 
1992)  and  his  colleagues  (e.g.,  Laird  et  al.,  1987).  Rising  on  the  winds  of  change,  SOAR  surpassed 
these  prior  harbingers  of  a  UTC  by  incorporating  more  detailed  assumptions  about  perceptual-motor 
and  attentional  processes  in  the  context  of  a  production-system  architecture.  Also,  extending  the 
approach  taken  before  in  GPS  (Newell  &  Simon,  1972),  an  explicit  characterization  of  complex 
impasse-driven  problem  solving  has  been  embodied  in  SOAR's  repertoire  of  cognitive  mechanisms. 
Through  an  opportunistic  "chunking"  algorithm,  impasses  that  arise  during  novel  task  performance 
are  resolved  by  SOAR  via  a  heuristic  search  of  problem  spaces.  The  "chunking"  algorithm  yields 
new  procedural  operators  that  enable  automatized  perceptual-motor  and  cognitive  skill  (cf.  J.  R. 
Anderson,  1983;  Schneider  &  Detweiler,  1988;  Shiffrin  &  Schneider,  1977).  As  a  result, 
phenomena  at  various  levels  of  complexity,  including  automatic  active-memory  search,  the  power 
law  of  practice,  and  means-ends  problem  solving  are  treated  within  SOAR's  purview. 

Nevertheless,  there  is  still  more  to  the  story  of  human  performance  than  SOAR  faithfully 
accommodates.  SOAR’s  components  for  implementing  simulations  of  ocular,  manual,  and 
articulatory  motor  control  remain  less  well  developed  than  would  be  desirable.  Given  this  lack  of 
development,  interactions  between  perception  and  motor  control  are  not  yet  characterized 
sufficiently  in  SOAR.  Nor  does  SOAR  have  much  to  say  thus  far  about  multiple-task  performance. 
What  executive  processes  allocate  limited  perceptual-motor  and  cognitive  resources  for  scheduling 
two  or  more  concurrent  tasks  while  satisfying  task  priorities  imposed  by  prevailing  environmental 
constraints?  How  do  executive  processes  and  skilled  performance  based  on  them  evolve  through 
transformations  of  declarative  to  procedural  knowledge?  Why  might  individual  performers  differ 
systematically  with  respect  to  the  types  of  executive  process  and  degrees  of  task  coordination  that 
they  achieve?  Many  such  open  questions  remain  to  be  answered  as  part  of  formulating  a  future 
unified  theory  of  cognition  and  action. 


Present  Objectives 

The  objectives  of  the  present  chapter  are  to  foster  further  theoretical  unification  in  the  scientific 
study  of  human  performance  and,  concomitantly,  to  help  answer  some  of  the  open  questions 
mentioned  previously.  To  do  so,  we  subsequently  introduce  a  functional  architecture  for  emulating 
basic  characteristics  of  human  information  processing.  Our  architecture  is  called  EPIC,  which 
stands  for  Executive-Process  Interactive  Control.  Although  EPIC  does  not  yet  constitute  a  complete 
veridical  UTC,  it  supplements  prior  theories  in  some  significant  ways.  EPIC  enables  not  only 
procedural  cognition  but  also  motor  control  and  perceptual-motor  interactions  to  be  treated  explicitly 
and  parsimoniously  in  conjunction  with  formal  hypotheses  about  supervisory  executive  cognitive 
processes  and  task-scheduling  strategies.  Given  such  treatment,  precise  computational  models  can 
be  constructed  to  explain  and  predict  reaction  times  (RTs),  response  accuracy,  and  other 
measureable  aspects  of  people's  overt  behavior  across  various  domains  where  multiple  tasks  must  be 
performed  concurrently.  The  domains  to  which  our  EPIC  computational  models  are  applicable 
include  both  elementap'  laboratory  paradigms  and  complex  real-world  situations.  As  outlined  later, 
results  from  such  applications  yield  instructive  lessons  that  should  be  taken  into  consideration  during 
future  work  toward  theoretical  unification  in  studies  of  human  performance.  The  transfer  of  data  and 
theory  from  laboratory  to  real  world  may  proceed  more  quickly  by  taking  these  lessons  seriously. 

Relevance  of  Multiple-Task  Performance 

Multiple-task  performance  under  speed  stress,  a  traditional  topic  of  human-performance  theory 
(Meyer  &  Komblum,  1993),  offers  an  especially  relevant  venue  for  developing  UTCs.  In  accord 
with  Newell's  (1990,  1992)  terminology,  this  topic  involves  studying  immediate  behavior,  that  is, 
responses  to  stimuli  during  brief  tasks  whose  performance  yields  reaction  times  on  the  order  of 
100  ms  <  RT  <  1000  ms.  Along  the  overall  time  scale  of  human  action,  the  mental  processes  that 
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mediate  such  responding  fall  in  the  lower  part  of  what  Newell  has  called  the  cognitive  band.^  Here 
fundamental  symbolic  computations  are  accomplished  to  access  information  in  various  memory 
stores,  make  elementary  decisions,  store  intermediate  products  temporarily,  and  execute  input-output 
transformations. 

Centrality  of  the  cognitive  band.  Newell  (1990, 1992)  has  argued  that  the  types  of  computation 
done  in  the  cognitive  band  are  fundamental  to  all  intelligent  information  processing.  From  the 
perspective  of  this  argument,  RTs  for  immediate  behavior  manifest  the  durations  of  these 
computations  directly,  revealing  the  nature  of  the  functional  architecture  that  implements  them.  Any 
hypothesized  architecture  must  be  consistent  with  available  RT  data,  which  impose  a  strong  real¬ 
time  constraint  on  UTCs.  By  taking  such  data  thoroughly  into  account,  as  focussing  on  rapid 
multiple-task  performance  leads  us  to  do,  we  may  arrive  more  quickly  at  detailed  specifications  for  a 
complete  veridical  UTC.  Thus,  during  the  initial  development  of  SOAR's  architecture,  Newell 
(1990,  1992)  and  his  colleagues  (e.g.,  John,  1988;  John  &  Newell,  1987,  1989;  John,  Rosenbloom,  & 
Newell,  1985)  made  concerted  efforts  to  ensure  that  it  was  consistent  with  known  facts  about 
stimulus-response  compatibility  (cf.  Duncan,  1977;  Fitts  &  Seeger,  1953;  Komblum,  Hasbroucq,  & 
Osman,  1990),  active-memory  scanning  (cf.  Sternberg,  1969),  and  transcription  typing  (cf. 

Salthouse,  1986),  which  typify  systematic  empirical  phenomena  related  to  immediate  behavior. 
Similar  attempts  have  been  made  by  other  progenitors  of  UTCs  (e.g.,  J.  R.  Anderson,  1983,  1990, 
1993),  even  though  they  do  not  go  as  far  as  EPIC  has  on  this  score. 

Availability  of  RT  data.  In  light  of  these  considerations,  students  of  rapid  human  performance 
are  well  simated  for  contributing  to  the  further  development  of  UTCs.  Through  techniques  of  mental 
chronometiy  (Donders,  1868/1969;  Luce,  1986;  Meyer,  Osman,  Irwin,  &  Yantis,  1988;  Miller,  1988; 
Roberts,  &  Sternberg,  1993;  Sanders,  1980;  Sternberg,  1969;  Woodworth,  &  Schlosberg,  1954), 
large  amounts  of  RT  data  that  are  relevant  to  immediate  behavior  have  been  collected.  Much  of  the 
available  data  bear  on  selective  attention  (e.g.,  Eriksen,  &  Yeh,  1985;  Jonides,  1980;  Jonides,  & 
Yantis,  1986;  Posner,  1980;  Treisman,  1988;  Yantis  &  Jonides,  1986),  motor  control  (e.g.,  Abrams 
&  Jonides,  1988;  Fischer  &  Ramsberger,  1984;  Ghez,  Hening,  &  Favilla,  1990;  Meyer  &  Gordon, 
1985;  Reuter-Lorenz,  Hughes,  &  Fendrich,  1991;  Rosenbaum,  1980;  Sternberg,  Monsell,  Knoll,  & 
Wright,  1978),  perceptual-motor  interaction  (e.g.,  Rosenbaum,  1991),  and  coordination  of 
information-processing  operations  in  multiple-task  performance  (e.g.,  Damos,  1991;  Gopher  & 
Donchin,  1986;  Meyer  &  Kieras,  1997a,  1997b).  This  "grist  for  the  mill"  is  exactly  what  we  need  to 
specify  correctly  the  details  of  prospective  UTCs'  functional  architectures. 

Importance  of  practical  needs.  With  respect  to  meeting  important  practical  needs,  the  study  of 
human  performance  could  use  more  theoretical  unification.  Multiple  tasks  must  be  performed 
rapidly  and  accurately  in  many  important  real-world  situations  such  as  HCI,  aircraft  cockpit 
operation,  air-traffic  control,  automobile  cellular-phone  communication,  power-plant  supervision. 


^  The  lower  part  of  the  cognitive  band  includes  two  sublevels  of  information  processing:  deliberate  acts  with  durations 
on  the  order  of  100  ms,  and  simple  operations  (short  sequences  of  deliberate  acts  used  to  perform  simple  tasks)  with  total 
durations  on  the  order  of  1  s  (Newell,  1990).  For  example,  under  our  EPIC  computational  models,  "firing"  a  production 
rule  during  a  typical  choice-reaction  task  constitutes  a  deliberate  act,  and  using  a  sequence  of  such  rules  to  perform  the 
whole  task  constitutes  a  simple  operation.  The  part  of  the  cognitive  band  directly  above  these  two  sublevels  consists  of 
composed  operations  (sequences  of  simple  operations  used  to  perform  complex  tasks)  with  total  durations  on  the  order 
of  10  s.  For  example,  skilled  playing  of  rapid-transit  chess  presumably  entails  composed  operations. 

*0  On  the  overall  time  scale  of  human  action,  there  are  also  other  activity  bands.  Below  the  cognitive  band  is  the 
biological  band,  a  physical  substrait  for  the  functional  architecture  of  people's  information-processing  system.  The 
biological  level  has  at  least  three  sublevels:  organelle,  neuron,  and  neural  circuit.  At  the  neural-circuit  sublevel, 
activities  take  on  the  order  of  10  ms  to  complete,  which  in  turn  yields  the  aforementioned  approximate  100  ms  durations 
of  deliberate  acts  (Newell,  1990;  cf  Footnote  9).  Connectionist  network  models  provide  abstract  characterizations  of  the 
neural-circuit  sublevel  (Rumelhart  &  McClelland,  1986).  Above  the  cognitive  band  are  the  rational  and  social  bands. 
The  rational  band  involves  complex  problem-solving  activities  with  durations  on  the  order  of  minutes  or  more.  SOAR  is 
especially  tailored  to  characterize  information  processing  in  the  rational  band.  Farther  up  scale,  the  social  band  involves 
the  long-term  pursuit  of  people's  life  goals.  No  prospective  UTC  yet  deals  seriously  with  the  social  band.  Instead,  on  the 
basis  of  Newell's  (1990)  analyses  and  arguments,  the  cognitive  band  is  most  important  for  present  purposes. 
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and  so  forth.  Advances  in  our  understanding  of  performance  under  such  circumstances  are  required 
so  that  the  design  of  person-machine  interfaces,  selection  of  personnel,  and  training  for  successful 
usage  may  be  facilitated.  This  facilitation  would  enhance  productivity  and  reduce  the  frequency  of 
disasters  such  as  those  involving  Three-Mile  Island  and  the  naval  cruiser  Vincennes. 

Inadequacy  of  current  frameworks.  Unfortunately,  the  general-purpose  theoretical  frameworks 
being  used  currently  in  applications  to  real-world  situations  where  multiple-task  performance  plays 
important  roles  are  less  than  fully  adequate.  These  frameworks  include  the  SAINT  (Chubb,  1981) 
and  HOS  (Lane,  Strieb,  Glenn,  &  Wherry,  1981;  Harris,  lavecchia,  Ross,  &  Shaffer,  1987)  modeling 
systems.  Although  constituting  valuable  assessment  tools,  they  do  not  enable  precise  computer 
simulations  of  complex  multiple-task  performance  nor  do  they  have  the  flexibility  and  generativity 
for  a  wide  variety  of  applications  (cf.  O'Donnell  &  Eggemeier,  1986;  Sanders,  1991;  Vreuls  & 
Obemayer,  1985).  Much  room  remains  for  significant  contributions  to  be  made  by  new  UTCs  in 
applied  settings.  Their  ultimate  benefits  may  grow  dramatically  as  future  technology  gives  human- 
factors  engineers  more  options  for  designing  efficient  user-friendly  person-machine  interfaces. 

Potential  Pitfalls 

Of  course,  one  must  be  vigilant  for  some  potential  pitfalls  along  the  way  toward  a  complete 
veridical  UTC;  developing  such  a  theory  will  not  be  easy. 

Necessity  of  selection.  In  particular,  there  is  the  necessity  of  selection  (Newell,  1990),  which 
poses  the  theorist  with  difficult  dilemmas.  Among  the  many  extant  theoretical  concepts  about 
human  performance,  some  must  be  included  and  others  excluded  from  any  particular  UTC,  even 
though  most  of  them  have  persuasive  advocates  and  potential  merit.  For  example,  should  a  UTCs 
perceptual  processors  be  equipped  with  precategorical  attentional  filters  (cf.  Allport,  1989; 
Broadbent,  1958;  Deutsch  &  Deutsch,  1963;  Moray,  1959;  Norman,  1976;  Treisman,  1960,  1964)? 
Should  its  cognitive  processor  have  structural  decision  and  response-selection  bottlenecks  (cf. 
Allport,  1987;  Allport,  Antonis,  &  Reynolds,  1972;  Kahneman,  1973;  Moray,  1967;  Navon  & 
Gopher,  1979;  Neumann,  1987;  Pashler,  1984,  1994a;  Welford,  1952,  1959,  1967;  Wickens,  1984)? 
Should  its  motor  processors  use  discrete  independent  movement  features  (cf.  Abrams  &  Jonides, 
1988;  Ghez,  Hening,  &  Favilla,  1990;  Goodman  &  Kelso,  1980;  Meyer  &  Gordon,  1985; 
Rosenbaum,  1980)?  The  apwers  to  such  questions  will  not  always  be  patently  obvious.  Depending 
on  what  particular  assumptions  are  selected  for  implementation  in  various  components  of  an  overall 
architecture,  different  prospective  UTCs  each  may  explain  many  observable  aspects  of  overt 
behavior  quite  well,  making  it  difficult  to  determine  which  UTC  is  most  correct. 

Turing  tar  pit.  Escaping  this  difficulty  is  especially  problematic  because  of  the  so-called 
"Turing  tar  pit"  (Newell,  1992).  In  computational  modeling,  the  available  programming  languages 
for  UTCs  are  all  powerful  enough  to  simulate  the  same  general  symbolic  transformations  and  input- 
output  functions  (cf.  Turing,  1937).  Thus,  at  an  abstract  level,  each  UTC  may  have  functional 
capabilities  similar  to  those  of  its  competitors,  which  can  make  alternative  theories  indistinguishable 
in  many  respects,  thereby  trapping  theorists  in  a  sticky  conceptual  morass.  Experimentalists 
therefore  should  expect  to  encounter  a  partial  non-identifiability  problem  when  devising  empirical 
tests  among  competing  alternatives. 

Degrees-offreedom problem.  Closely  related  to  the  non-identifiability  problem  and  Turing  tar 
pit  is  the  degrees-of-freedom  problem  (Newell,  1990,  1992).  By  constmction,  a  UTC  necessarily 
has  many  parameters  whose  values  can  change  from  one  context  to  the  next.  Within  and  across 
particular  contexts,  the  number  of  "free"  parameters  may  exceed  the  total  degrees  of  freedom  in 
available  data  sets.  If  so,  then  the  theory  will  be  underdetermined  with  respect  to  the  data;  good 
quantitative  accounts  provided  by  the  theory  will  not  prove  definitively  that  the  theory  is  apt  or 
informative.  In  formulating  a  UTC,  care  therefore  must  be  taken  to  impose  principled  constraints  on 
the  theory's  potentially  free  parameters. 
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Bridges  Over  Troubled  Waters 

Fortunately,  there  are  a  number  of  supportive  heuristic  principles  for  bridging  the  troubled 
waters  that  must  be  crossed  along  the  way  to  a  complete  veridical  UTC.  Some  of  these  principles 
can  help  especially  to  overcome  the  necessity  of  selection,  Turing  tar  pit,  and  degrees-of-freedom 
problem  in  designing  the  theory's  functional  architecture.  Other  principles  constrain  specific  models 
that  may  be  formulated  to  perform  particular  tasks  on  the  basis  of  the  architecture. 

Maintenance  of  architectural  simplicity  and  stability.  To  overcome  the  degrees-of-freedom 
problem,  a  UTC’s  functional  architecture  should  be  kept  as  simple  and  stable  as  possible.  The 
theorist  must  refrain  from  embellishing  the  architecture  with  empirically  unsubstantiated, 
computationally  arbitrary,  unnecessarily  complex,  or  seductively  vague  mechanisms.  For  example, 
initially  postulating  an  immutable  structural  response-selection  bottleneck  (Pashler,  1994a;  Welford, 
1959,  1967)  or  reservoir  of  divisible  limited  capacity  (Kahneman,  1973)  in  the  architecture's 
cognitive  processor  seems  inadvisable  (Allport,  1987;  Neumann,  1987;  Wickens,  1991).  As  Allport 
forewarned: 

"Obviously  there  is  a  problem  of  how  we  know  when  we  are  dealing  with  competition  for  a 
single  resource....  Once  one  accepts  the  idea  of  general-purpose  processing  capacity  [or  central 
bottlenecks]  as  a  working  hypothesis,  it  becomes  temptingly  easy  to  assume,  without  further 
ado,  that  almost  any  instance  of  dual-task  interference  is  a  result  of  competition  for  this  same 
general  resource,  for  'attention'....  The  theory,  at  least  in  its  application,  appears  to  be  entirely 
circular....  The  result  is  a  strategy  of  research  that  can  do  nothing  but  chase  its  own  tail....  This 
[strategy]  has  been  singularly  unproductive  ...  for  the  discovery  of  the  architectural  constraints 
on  concurrent  psychological  processes....  It  merely  soothes  away  curiosity  by  the  appearance  of 
having  provided  an  explanation,  even  before  the  data  have  been  obtained"  (Allport,  1980, 
pp.  117-118, 121). 

Given  Allport's  provisoes,  the  architecture  of  a  UTC  ought  to  include  only  mechanisms  that  have 
firm  a  priori  physical  or  mental  grounds,  such  as  those  known  to  be  inherent  in  either  the  human 
body's  sensors  and  effectors  or  intelligent  mind's  basic  computational  needs  (Newell,  1990).  Then 
fewer  degrees  of  freedom  will  clutter  the  theoretical  landscape  and  compromise  assessments  about 
goodness-of-fit  to  empirical  data. 

Such  benefits  likewise  may  accrue  through  keeping  the  numerical  parameters  of  the  architecture 
constant  insofar  as  possible.  For  example,  suppose  that  in  two  task  contexts,  the  same  stimuli  or 
responses  are  involved.  If  so,  then  setting  the  parameters  of  the  architecture's  perceptual-motor 
processors  to  have  identical  values  across  both  contexts  will  reduce  the  degrees-of-freedom  problem 
considerably.  Similar  beneficial  constraints  can  be  imposed  on  the  values  of  cognitive-processor 
parameters  (Meyer  &  Kieras,  1997a,  1997b). 

Embodiment  of  perceptual-motor  mechanisms.  As  the  preceding  discussion  also  implies, 
UTCs  can  gain  more  power  and  testability  from  being  physically  "embodied".  If  a  UTC  makes 
direct  contact  with  the  real  world  through  explicitly  represented  perceptual  and  motor  mechanisms, 
then  the  theory's  degrees  of  freedom  at  a  cognitive  level  are  likely  to  be  reduced,  and  the  Turing  tar 
pit  of  abstractly  conceivable  but  concretely  implausible  alternative  computational  algorithms  may  be 
circumvented.  Again  Allport  has  summarized  the  basic  point  quite  well: 

"The  constraints  of  the  human  body  set  upper  limits  on  the  degrees  of  freedom  of  our  physical 
action.  A  limb  cannot  be  in  two  positions  at  once.  We  cannot  shift  our  gaze  simultaneously  to 
right  and  left,  nor  vocalize  two  different  syllables  at  the  same  time....  Certainly,  many  of  the 
phenomena  attributed  hitherto  to  'attentional'  or  'general-capacity'  limitations  can  be  seen  to 
depend  on  situations  in  which  separate  inputs  compete  for  or  share  control  of  the  same  category 
of  action....  It  may  be  that  until  we  have  a  better  description  of  what  is  being  done  by  at  least 
some  of  the  sub-systems,  [other]  questions  about  the  overall  architecture  will  just  be  premature" 
(Allport,  1980,  pp.  144, 145,  148). 
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A  prospective  UTC  therefore  should  have  an  array  of  well-defined  perceptual  processors  for  the 
principal  stimulus  input  modalities,  and  motor  processors  for  the  principal  response  output 
modalities  should  be  included  too.  Some  companion  chapters  in  Attention  an^  Performance  XVII 
are  especially  relevant  in  this  regard  because  they  contribute  substantially  to  explicating  how  these 
perceptual-motor  processors  function. 

Respect  for  neurophysiological  plausibility.  To  be  taken  seriously,  a  prospective  UTC  must 
have  neurophysiological  plausibility  as  well.  Ultimately,  whatever  architectural  components  are 
postulated  in  the  theory  ought  to  accord  with  mechanisms  in  the  "biological  band"  of  human 
information  processing  (Newell,  1990,  1992;  cf.  Footnote  10).  This  requirement  further  justifies 
omitting  inunutable  structural  decision  and  response-selection  bottlenecks  in  the  cognitive  processor 
of  the  theory’s  architecture.  Instead,  the  architecture  should  enable  substantial  asynchronous 
distributed  parallel  information  processing  (cf.  Rumelhart  &  McClelland,  1986).  As  Neumann 
argued: 

"[There  is  no]  physiologically  established  limit  on  the  information  that  can  be  picked  up  at  the 
same  time.  Neither  are  there  obvious  neurophysiological  grounds  for  the  assumption  that  dual¬ 
task  performance  is  limited  by  the  hardware  properties  of  the  brain.  [Instead]  there  is  an 
immense  amount  of  parallel  computation  going  on  simultaneously  in  the  awake  brain  (see  J.  A. 
Anderson  &  Hinton,  1981;  Creutzfeldt,  1983);  and  there  are  many  subsystems  that  integrate 
information  from  different  sources  without  an  indication  of  limited  capacity"  (Neumann,  1987, 
pp.  362). 

Sensitivity  to  task  demands.  Another  crucial  principle  for  formulating  computational  models  of 
multiple-task  performance  involves  being  fully  sensitive  to  the  logical  demands  of  the  tasks  at  hand. 
By  doing  so,  the  set  of  plausible  models  that  can  perform  the  tasks  will  become  considerably  smaller 
and  more  well  defined.  As  a  result,  precise  thorough  accounts  that  have  minimal  degrees  of  freedom 
may  quickly  emerge  for  available  data. 

For  example,  we  (Meyer  &  Kieras,  1997a,  1997b)  have  found  this  with  respect  to  the 
psychological  refractory-period  (PRP)  procedure,  which  entails  performing  two  discrete  RT  tasks  in 
rapid  succession  (Bertelson,  1966;  Kantowitz,  1974;  Pashler,  1994a;  Smith,  1967).  Under  the  PRP 
procedure,  the  instructions  to  participants  typically  demand  that  one  task  be  "primary"  and  the  other 
"secondary";  the  participants  are  supposed  to  produce  rapid  accurate  primary-task  responses  before 
making  secondary-task  responses.*’  These  requirements  strongly  constrain  which  models  can 
account  for  RT  data  from  the  PRP  procedure,  especially  when  the  models'  architecture  has  no 
inherent  structural  cognitive  response-selection  bottleneck.  The  PRP  procedure  mandates  that  in 
essence,  viable  models  must  emulate  such  a  bottleneck  at  some  point  during  the  course  of  secondary- 
task  performance  even  though  one  would  not  otherwise  be  needed. By  honoring  this  mandate,  our 
research  has  achieved  accurate  accounts  of  RT  data  from  not  only  the  PRP  procedure  but  also  other 
related  dual-task  paradigms. 

Application  ofGOMS  analysis.  Useful  guidelines  for  representing  the  logical  consequences  of 
procedural  instructions  and  task  demands  are  provided  by  GOMS  analysis  (Card  et  al.,  1983;  John, 
1990;  John  &  Kieras,  1996;  John,  Vera,  &  Newell,  1994;  Kieras,  1988;  Newell,  1990).  GOMS 
stands  for  goals,  operators,  methods,  and  selection  rules.  In  this  analytical  technique,  the  first  step 


’  ’  For  example,  in  a  study  by  Pashler  (1984,  Exp.  1),  "the  subject  was  instructed  to  respond  as  quickly  as  possible  to 
both  tasks  in  the  two-task  blocks,  with  the  restriction  that  the  first  stimulus  must  be  responded  to  before  the  second"  (p. 
365).  Similarly,  in  a  study  by  Pashler  and  Johnston  (1989),  subjects  were  told  that  they  "should  respond  as  rapidly  as 
possible  to  the  first  stimulus,"  and  "the  experimenter  emphasized  the  importance  of  making  the  first  response  as 
promptly  as  possible"  (p.  30). 

The  emulation  of  a  bottleneck  is  required  because  without  one,  secondary-task  responses  might  be  selected  and 
produced  before  primary-task  responses,  violating  the  instructions  of  the  PRP  procedure  (Meyer  &  Kieras,  1997a). 
According  to  this  logic,  the  function  of  the  emulated  bottleneck  is  to  delay  secondary-responses  enough  that  out-of-order 
responding  never  happens. 
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involves  identifying  and  organizing  the  goals  and  subgoals  for  all  present  tasks,  as  dictated  by  the 
prevailing  physical  environment,  instructions  about  task  priorities,  and  so  forth.  Next,  procedural 
methods  are  formulated  so  that  these  goals  and  subgoals  may  be  achieved.  As  part  of  this 
formulation,  sequences  of  operators  (i.e.,  perceptual,  cognitive,  and  motor  transformations)  are 
chosen  from  an  inventory  provided  by  the  UTC’s  functional  architecture.  The  choice  of  operators  is 
governed  by  explicit  selection  rules  that  tailor  the  operator  sequence  to  be  sufficient  and  efficient. 

For  example,  these  rules  may  invoke  the  rationality  principle  (Anderson,  1990, 1993;  Card  et  al., 
1983;  Newell,  1990),  according  to  which  a  system's  operations  should  have  maximum  expected 
utility.  We  have  found  GOMS  analysis  to  be  especially  useful  in  modeling  performance  of  some 
relatively  complex  practical  tasks  associated  with  HCI  (Kieras  &  Meyer,  1997;  Kieras,  Wood,  & 
Meyer,  1997).  In  contrast,  failure  to  apply  GOMS  analysis  may  leave  the  theorist  trapped  by  the 
morass  of  the  Turing  tar  pit,  as  Newell  (1973a)  originally  forewarned: 

"the  same  human  subject  can  adopt  many  radically  different  methods  for  the  same  basic  task, 
depending  on  goals,  background  knowledge,  and  minor  details  of  payoff  stmcture....  To  predict 
a  subject  you  must  know:  (1)  his  goals;  and  (2)  the  task  environment....  [Until  these  factors  are 
taken  into  account]  we  will  not  be  able  to  bring  the  problem  of  specifying  subjects'  methods 
under  control"  (Newell,  1973a;  pp.  293, 299,  301). 

Compliance  with  real-time  constraint.  Finally,  the  functional  architecture  of  a  UTC  must 
comply  with  the  real-time  constraint  on  immediate  behavior  (Newell,  1990, 1992).  People  perform 
elementary  tasks  through  relatively  simple  combinations  of  operations  at  the  lower  end  of  the 
"cognitive  band",  producing  reaction  times  on  the  order  of  100  ms  <  RT  <  1000  ms.  Having  to 
describe  such  rapid  performance  in  terms  of  mechanisms  that  also  respect  known  properties  of  their 
underlying  neural  "wetware"  provides  more  signposts  for  circumventing  the  Turing  tar  pit  and 
degrees-of-freedom  problem. 

Taxonomy  of  Lessons 

The  remainder  of  this  chapter  outlines  a  number  of  lessons  that  we  have  learned  from  our 
adherence  to  the  aforementioned  heuristic  principles  for  developing  a  unified  theory  of  cognition  and 
action.  Two  types  of  lesson  are  summarized  subsequently:  methodological,  and  substantive.  The 
substantive  lessons  highlight  specific  new  empirical  facts  and  theoretical  conclusions  about  the 
human  information-processing  system.  The  methodological  lessons  highlight  inherent  nuiances  of 
the  scientific  enterprise  whereby  UTCs  and  computational  models  of  multiple-task  performance  may 
be  formulated. 

Lists  of  the  methodological  and  substantive  lessons  appear  in  Tables  1  and  2,  respectively.  So 
that  these  lessons  may  be  more  memorable,  we  adopt  the  classical  pedagogic  practice  (e.g., 

Saunders,  1757)  of  expressing  them  with  brief  epigrammatic  statements  adapted  from  various 
familiar  sources.  Our  hope  is  that  by  learning  about  the  trials  and  tribulations  through  which  we 
have  been  taught,  other  experimental  psychologists,  cognitive  scientists,  and  human-factors 
engineers  will  make  faster  progress  toward  understanding  human  multiple-task  performance. 

For  example,  three  instructive  methodological  lessons  from  our  efforts  thus  far  should  be 
apparent  in  light  of  the  preceding  discourse: 

Methodological  Lesson  1:  Now  is  the  hour.  The  moment  has  come  to  make  more  progress 
toward  theoretical  unification  in  the  scientific  study  of  human  performance.  Ample  data  and 
theoretical  concepts  are  available  to  support  such  an  advance.  Persistent  neglect  of  theoretical 
unification  will  waste  precious  resources  and  postpone  utilitarian  transfer  of  theory  and  data  from  the 
laboratory  to  practical  real-world  applications. 
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Table  1 

Methodological  Lessons 


Lesson 

Number 

Source  of  Lesson 

Epigrammatic  Statement 

1 

Newell  (1973,  1990,  1992) 

"Now  is  the  hour" 

2 

Newell  (1990,  1992) 

"Reaction  time  is  of  the  essence" 

3 

Newell  (1973,  1990, 1992) 

"No  pain,  no  gain" 

4 

PRP  procedure 

"Our  cup  runneth  over" 

5 

PRP  procedure 

"Expect  the  unexpected" 

6 

PRP  procedure 

"Be  careful  what  you  ask  for;  you  might  actually  get  it" 

7 

AEG  models^ 

"Average  at  your  own  risk" 

8 

AEG  models^ 

"Seek  and  you  shall  find; 
knock  and  it  shall  be  opened  unto  you" 

9 

NYNEX  study  of  TAOs*’ 

"Unification  enables  application" 

10 

NYNEX  study  of  TAOs^" 

"Keep  your  sunny  side  up" 

11 

NRL  cockpit  study‘s 

"If  you've  seen  one,  you've  seen  'em  all" 

12 

NRL  cockpit  study‘s 

"Be  thankful  for  The  Second  Golden  Rule" 

13 

Feynman  (1985) 

"Psychological  science  can  (and  will)  be  fun" 

14 

Gopher  (1996) 

"There's  nothing  so  useful  as  a  good  theory" 

^  See  Lauber  et  al.  (1994);  Meyer  et  al.  (1995);  Meyer  &  Kieras  (1997b);  Schumacher  et  al.  (1997). 
^  See  Kieras,  Wood,  &  Meyer  (1995,  1997). 

^  See  Kieras  &  Meyer  (1995,  1997). 
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Table  2 

Substantive  Lessons 


Lesson 

Number 

Source  of  Lesson 

Epigrammatic  Statement 

1 

SRD  modeP 

"Response  selection  is  not  like  pouring  bottled  wine" 

2 

SRD  modeP 

"It's  difficult  to  leap  before  you  look" 

3 

AEC  models*^ 

"Variety  is  the  spice  of  life" 

4 

AEC  models*’ 

"Wherever  there's  a  will,  there  are  ways" 

5 

AEC  models’’ 

"You  can  teach  young  dudes  new  tricks" 

6 

Schumacher  et  al.  (1997) 

"Dual-task  performers  can  share  and  share  alike" 

7 

NYNEX  study  of  TAOs'= 

"TAGS  know  the  way" 

8 

NYNEX  study  of  TAOs'’ 

"What  goes  around  comes  around" 

9 

NYNEX  study  of  tags'’ 

"Hand  movements  obey  the  Boy  Scout  motto" 

10 

NYNEX  study  of  tags'’ 

"Practice  makes  (nearly)  perfect" 

11 

NRL  cockpit  study*^ 

"Daring  task  scheduling  is  The  Right  Stuff" 

12 

NRL  cockpit  study'* 

"The  eyes  have  it" 

13 

NRL  cockpit  study*^ 

"Covert  shifts  of  visual  attention  are  like  poor  Yorick" 

^  Based  on  EPIC  computational  modeling  of  mean  RTs  from  Hawkins  et  al.  (1979). 

^  Based  on  EPIC  computational  modeling  of  mean  RTs  from  PRP  procedure  by  Lauber  et  al.  (1994). 
^  Based  on  EPIC  computational  modeling  by  Kieras,  Wood,  &  Meyer  (1995,  1997). 

Based  on  EPIC  computational  modeling  by  Kieras  &  Meyer  (1995, 1997). 
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Methodological  Lesson  2:  Reaction  time  is  of  the  essence.  The  development  of  complete 
veridical  UTCs  will  benefit  greatly  from  RT  data  that  experimental  psychology  has  amassed  while 
studying  the  performance  of  various  basic  tasks.  These  data,  which  impose  strong  real-time 
constraints  on  a  UTCs  functional  architecture,  are  abundant  and  sorely  in  need  of  further  integration. 
Attention  and  Performance  XVII  together  with  other  volumes  of  this  symposium  series  therefore  can 
play  a  cmcial  role  in  fostering  theoretical  unification. 

Methodological  Lesson  3:  No  pain,  no  gain.  Future  steps  toward  veridical  UTCs  will  not  be 
easy.  Considerable  misdirection,  stumbling,  and  frustration  will  occur  along  the  way  due  to  the  vast 
current  inventory  of  potentially  apt  theoretical  constructs,  Turing  tar  pit,  degrees-of-freedom 
problem,  and  other  concomitant  obstacles.  Together  these  hinderances  may  cause  the  adventurous 
theorist  considerable  fatigue  and  suffering,  just  as  Indiana  Jones  encountered  many  daunting 
challenges  in  his  search  for  the  Lost  Ark  of  the  Covenant  (Kasdan  et  al.,  1981).  Yet  the  ultimate 
prize  remains  worth  the  hardships  that  must  be  endured  to  obtain  it;  both  basic  research  and  practical 
applications  will  benefit  enormously  from  the  elegance,  integration,  explanatory  power,  and 
predictive  capability  of  a  veridical  UTC.  Ultimately,  UTCs  are  the  only  way  to  achieve  such 
benefits  (Newell,  1973a,  1990,  1992). 

To  provide  more  context  for  other  subsequent  lessons,  we  next  introduce  our  EPIC  information¬ 
processing  architecture. 


The  EPIC  Architecture 

Figure  1  shows  a  schematic  diagram  of  EPIC.  It  consists  of  interconnected  LISP  software 
modules  for  symbolic  perceptual,  cognitive,  and  motor  information  processing.  We  have  designed 
these  modules  to  emulate  basic  components  of  the  human  information-processing  system  and  to 
provide  a  basis  for  realistic  computational  models  of  multiple-task  performance.  As  acknowledged 
before,  EPIC's  organization  builds  on  previous  work  by  a  number  of  theorists  (e.g.,  J.  R.  Anderson, 
1976, 1983,  1990,  1993;  Card  et  al.,  1983;  Hunt  &  Lansman,  1986;  Laird  et  al.,  1987;  Newell, 

1973a,  1973b,  1990,  1992). 

Architectural  Components 

During  computer  simulations  with  EPIC,  its  perceptual  processors  receive  information  from 
simulated  sensors  that  transduce  stimuli  presented  through  input  devices  (e.g.,  display  screens  and 
headphones)  in  a  virtual  task  environment.  After  specified  parametric  delays,  symbolic  stimulus 
codes  are  sent  by  the  perceptual  processors  to  the  declarative  working  memory  of  EPIC's  cognitive 
processor.  The  cognitive  processor  maintains  the  contents  of  working  memory,  executes  procedures 
for  performing  various  tasks,  and  instructs  the  motor  processors  by  sending  them  symbolic  response 
codes  about  what  actions  to  take.  The  motor  processors  prepare  and  produce  movements  by 
simulated  effectors  that  operate  output  devices  (e.g.,  keyboards,  joysticks,  and  microphones)  in  the 
virtual  task  environment. 

Together,  EPIC  and  its  task  environment  provide  a  basis  for  modeling  multiple-task 
performance  in  a  variety  of  contexts.  The  components  of  the  architecture  are  tailored  to  be  generally 
applicable  and  consistent  with  available  empirical  data  about  the  nature  of  the  human  information¬ 
processing  system  (e.g.,  Atkinson  et  al.,  1988;  Boff,  Kaufman,  &  Thomas,  1986;  Meyer  & 
Komblum,  1993;  Woodworth  &  Schlosberg,  1954).  Using  the  architecture  and  computational 
models  based  on  it,  a  theorist  literally  can  watch  a  simulated  performer  do  single  or  multiple 
perceptual-motor  and  cognitive  tasks,  just  as  an  experimenter  observes  the  performance  of  a  real 
person. 
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Figure  1.  Diagram  of  the  Executive-Process/Interactive-Control  (EPIC)  information-processing 
architecture. 
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Perceptual  processors.  In  EPIC,  the  visual,  auditory,  and  tactile  sensory  modalities  each  has  its 
own  perceptual  processor.  The  inputs  to  the  perceptual  processors  are  stimuli  transduced  by 
simulated  eyes,  ears,  and  haptic  receptors.  For  example,  EPIC's  eyes  have  retinas  with  foveal, 
parafoveal,  and  peripheral  regions,  so  the  quality  of  the  inputs  to  the  visual  perceptual  processor 
depends  on  the  retinal  locations  of  stimuli.  The  perceptual  processors  produce  symbolic  codes  for 
stimulus  features  such  as  the  locations,  shapes,  sizes,  colors,  and  identities  of  visual  objects,  which 
are  sent  to  modality-specific  partitions  of  working  memory  that  contain  various  "object  files"  (cf. 
Treisman,  1988).  Such  perceptual  processing  takes  various  amounts  of  time  to  be  completed.  In 
particular,  stimulus  detection  and  identification  times  are  parameters  for  our  computer  simulations 
(cf.  Woodworth  &  Schlosberg,  1954).  Their  contributions  are  modulated  by  the  fact  that  EPIC's 
perceptual  processors  function  asynchronously  and  simultaneously  with  the  cognitive  processor, 
whose  operations  depend  on  the  contents  of  working  memory. 

Working  memory.  EPIC's  working  memory  has  various  subdivisions  for  storing  symbolic 
information.  Some  subdivisions  of  working  memory  contain  visual,  auditory,  and  tactile  perceptual 
stimulus  codes  in  "object  files"  (cf.  Baddeley,  1986).  Over  time,  these  codes  may  decay  unless 
EPIC's  cognitive  processor  refreshes  them.  Other  subdivisions  of  working  memory  contain  response 
codes  and  efference  copies  of  movements.  Also  stored  in  working  memory  are  control  codes  for  the 
goals,  procedural  steps,  and  status  of  current  tasks,  which  contribute  to  operations  by  the  cognitive 
processor. 

Cognitive  processor.  EPIC's  cognitive  processor  is  programmed  with  production  rules  in  a 
procedural  memory  store.  Each  rule  there  states  that  if  its  specified  conditions  are  true,  then  its 
specified  actions  should  be  executed.  For  a  rule's  conditions  to  be  true,  they  must  match  items 
currently  in  working  memo^.  When  such  matches  occur,  the  rule's  actions  update  the  contents  of 
working  memory  and  send  instructions  to  EPIC's  motor  processors.  In  this  way,  production  rales 
can  be  used  to  perform  various  perceptual-motor  and  cognitive  tasks. 

For  example,  during  a  primary  auditory-manual  choice-reaction  task,  the  following  rale  might 
be  used  by  EPIC's  cognitive  processor  to  instruct  the  manual  motor  processor  that  it  should  prepare 
and  produce  a  keypress  with  the  left  index  finger  in  response  to  an  800  Hz  tone: 

IF 

((GOAL  DO  TASK  1) 

(STRATEGY  TASK  1  IS  IMMEDIATE) 

(AUDITORY  TONE  800  ON) 

(STEP  DO  CHECK  FOR  TONE  800)) 

THEN 

( (SEND-TO-MOTOR  (MANUAL  PERFORM  LEFT  INDEX) ) 

(ADD  (TASK  1  RESPONSE  UNDERWAY) ) 

(ADD  (STEP  WAIT  FOR  TASK  1  RESPONSE  COMPLETION)) 

(DEL  (STEP  DO  CHECK  FOR  TONE  800)) 

(DEL  (AUDITORY  TONE  800  ON) ) ) . 

The  actions  of  this  rale,  which  not  only  instructs  the  manual  motor  processor  but  also  adds  and 
deletes  items  in  working  memory,  would  be  executed  whenever  working  memory  contains  all  of  the 
items  in  the  rule's  conditions.  For  each  current  task  that  EPIC  is  supposed  to  perform,  there  would 
be  a  set  of  such  rules  in  procedural  memory.  Also,  complementing  these  task-rule  sets,  procedural 
memory  may  contain  sets  of  executive-process  rules  that  help  manage  the  contents  of  working 
memory  and  that  coordinate  multiple-task  performance  with  respect  to  prevailing  task  priorities. 

Theoretical  RTs  for  multiple-task  performance  stem  from  additional  properties  of  EPIC's 
cognitive  processor.  During  performance  simulations,  task  and  executive  production  rales  are 


At  present,  the  executive  and  task  production  rules  for  EPIC  computational  models  are  placed  in  procedural  memory 
by  the  theorists  who  use  this  simulation  system  (e.g.,  Meyer  &  Kieras,  1997a,  1997b).  EPIC,  unlike  some  other 
architectures  (cf.  J.  R.  Anderson,  1983;  Laird  et  al.,  1987),  does  not  yet  have  the  capability  to  acquire  perceptual-motor 
and  cognitive  skills  through  on-line  procedural-learning  algorithms.  Nevertheless,  in  future  research,  we  may  augment 
EPIC  with  such  capabilities. 
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applied  by  the  production-rule  interpreter  of  the  cognitive  processor,  which  is  based  on  a 
Parsimonious  Production  System  (PPS;  Covrigaru  &  Kieras,  1987).  Under  PPS,  the  production-rale 
interpreter  operates  through  a  series  of  processing  cycles,  whose  individual  durations  vary 
stochastically  around  a  mean  value  that  typically  equals  50  ms.'^  At  the  start  of  each  cycle,  the 
conditions  of  all  rales  currently  in  procedural  memory  are  tested  against  the  present  contents  of 
working  memory.  At  the  end  of  each  cycle,  for  every  rale  whose  conditions  completely  match  the 
contents  of  worldng  memory,  all  of  the  rale's  actions  are  executed.  Given  the  complexity  of 
representative  multiple-task  performance,  several  successive  processing  cycles  may  be  required  to 
complete  each  of  two  or  more  concurrent  tasks,  yielding  EPIC's  theoretical  RTs. 

At  present,  however,  we  assume  that  there  is  no  limit  on  how  many  production  rules  can  have 
their  conditions  tested  and  actions  executed  during  a  processing  cycle.  EPIC's  cognitive-processor 
cycle  durations  have  the  same  distribution  regardless  of  how  many  production  rales  are  involved.  In 
this  respect,  our  system  architecture  therefore  differs  radically  from  some  past  theoretical 
frameworks;  it  does  not  have  an  inherent  structural  cognitive  response-selection  bottleneck  or 
limited  reservoir  of  divisible  processing  capacity  (cf.  Kahneman,  1973;  Moray,  1967;  Pashler,  1984, 
1994a;  Welford,  1959, 1967).  On  the  contrary,  with  appropriate  sets  of  executive  and  task 
production  rales,  EPIC's  cognitive  processor  may  select  responses  and  do  other  operations 
simultaneously  for  concurrent  tasks,  avoiding  between-task  interference  at  this  "central"  level. 

A  meta-theoretical  rationale  for  such  assumptions  appears  in  Meyer  and  Kieras  (1997a).  Our 
reasons  adhere  to  the  principles  of  parsimony  and  neurophysiological  plausibility  espoused  by  other 
proponents  of  unification  in  human-performance  theory  (e.g..  Allport,  1987;  Neumann,  1987; 
Newell,  1990,  1992).  Some  empirical  support  for  the  present  assumptions  about  the  capacity  of 
EPIC's  cognitive  processor  is  provided  by  studies  of  multiple-task  performance  in  which  virtually 
perfect  time-sharing  between  two  tasks  has  occurred  (e.g..  Allport  et  al.,  1972;  Greenwald  & 
Shulman,  1973;  Hirst,  Spelke,  Reaves,  Caharack,  &  Neisser,  1980;  Koch,  1993, 1994;  Shaffer,  1975; 
Wickens,  1984). 

Motor  processors.  Nevertheless,  at  a  "peripheral"  level,  EPIC's  motor  processors  do  act  like 
bottlenecks  similar  to  ones  proposed  by  some  other  theorists  (e.g.,  Kantowitz,  1974;  Keele,  1973; 
Keele  &  Neill,  1978;  Reynolds,  1964).  In  our  architecture,  the  ocular,  manual,  and  vocal  response 
modalities  each  has  its  own  motor  processor.  Typically,  the  inputs  to  the  motor  processors  are 
symbolic  codes  for  responses  that  have  been  selected  by  the  cognitive  processor  with  its  production 
rules.i^  The  outputs  by  the  motor  processors  are  movements  of  simulated  eyes,  hands,  and  mouth, 
which  interact  with  the  virtual  task  environment.  For  example,  the  manual  motor  processor  can 
produce  various  styles  of  hand  movement  such  as  pointing,  key  pressing,  typing,  and  joystick  plying. 
The  ocular  motor  processor  can  produce  eye  movements  through  either  voluntary  cognitive  control 
or  reflexive  perceptual  control  (cf.  Fischer  &  Ramsberger,  1984;  Rafal,  Henik,  &  Smith,  1991; 
Reuter-Lorenz,  Hughes,  &  Fendrich,  1991).  Although  the  ocular,  manual,  and  vocal  motor 
processors  may  all  operate  at  the  same  time,  each  of  them  individually  is  a  single-channel 
mechanism  that  limits  the  overall  rate  of  overt  movements. 

To  be  precise,  we  assume  that  after  receiving  the  symbolic  code  for  a  selected  response,  a  motor 
processor  converts  it  to  elementary  movement  features  that  the  response  should  have  overtly.  For 
example,  a  keypress  by  the  manual  motor  processor  might  have  features  that  specify  the  movement 


The  mean  cognitive-processor  cycle  duration  is  one  of  EPIC's  temporal  parameters  that  remains  the  same  across 
different  simulations.  We  set  it  to  50  ms  because  of  both  theoretical  and  empirical  considerations.  Newell  (1990)  has 
argued  that  theoretically,  the  time  taken  for  testing  the  conditions  and  executing  the  actions  of  generic  production  rules 
like  those  used  here  should  be  on  the  order  of  50  ms  per  rule,  given  known  temporal  constraints  on  the  neural-network 
circuits  whereby  such  operations  occur  at  the  biological  level  (cf.  Footnote  10).  Furthermore,  Kristofferson  (1967)  has 
reported  empirical  results  about  perceptual-successiveness  judgments  and  choice  RTs  that  suggest  a  mean  cognitive- 
processor  cycle  duration  of  about  50  ms.  This  value  is  approximately  the  same  as  the  alpha  rhythm's  mean  zero-crossing 
interval  in  EEG  records  of  brainwave  activity  (Callaway  &  Yeager,  1960;  Kristofferson,  1967;  Ray,  1990). 

•5  Under  some  circumstances,  responses  also  may  be  produced  by  a  motor  processor  on  the  basis  of  sensory  information 
sent  directly  to  it  by  a  perceptual  processor  via  a  pathway  that  by-passes  the  cognitive  processor  and  involves  an 
automatic  "privileged  loop"  (McLeod  &  Posner,  1984). 
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style,  hand,  and  finger  to  be  used  (e.g.,  press  ,  left  ,  index).  Consistent  with  some  empirical 
results  (e.g.,  Abrams  &  Jonides,  1990;  Meyer  &  Gordon,  1985;  Rosenbaum,  1980;  Yaniv,  Meyer, 
Gordon,  Huff,  &  Sevald,  1990),  the  movement  features  for  an  overt  response  are  prepared  serially, 
with  each  feature-preparation  step  consuming  on  the  order  of  50  ms.  After  all  of  the  movement 
features  for  a  response  have  been  prepared,  the  response  is  produced  overtly  through  a  final  initiation 
step  that  likewise  takes  on  the  order  of  50  ms.  Thus,  while  symbolic  response  codes  for  concurrent 
tasks  may  be  selected  in  parallel  by  EPICs  cognitive  processor,  the  production  of  distinct  overt 
responses  by  the  same  motor  processor  would  have  to  be  temporally  staggered,  causing  potential 
between-task  or  "structural"  interference  (cf.  Kahneman,  1973). 

An  especially  important  case  of  such  interference  involves  concurrent  tasks  that  each  require 
manual  responses.  EPIC  has  only  one  motor  processor  devoted  to  preparing  and  initiating 
movements  by  the  two  (i.e.,  right  and  left)  hands.  For  multiple  manual  tasks,  substantial  between- 
t^k  interference  is  therefore  possible  at  the  peripheral  motor  level  even  when  the  two  tasks  utilize 
different  hands  and  different  sensory  modalities.  Effective  coping  with  such  interference  requires 
judicious  supervisory  control.  That  this  control  is  needed  under  these  circumstances  has  been 
demonstrated  amply  by  past  studies  of  manual  movement  production  in  multiple-task  performance 
(e.g.,  Ivry,  Franz,  Kingstone,&  Johnston,  1994,  1996;  McLeod,  1977). 

Contributions  by  Attention  and  Performance  symposia.  From  the  previously  cited  references 
that  helped  stimulate  the  development  of  EPIC,  it  now  should  be  clear  that  past  Attention  and 
Performance  symposia  have  made  major  contributions  to  our  thinking.  Literally  dozens  of  these 
references  have  been  published  as  part  of  this  series,  and  many  of  them  have  entailed  reports  of 
informative  RT  data.  Their  prevalence  in  the  present  chapter  provides  substantial  inspiration  for  our 
first  two  methodological  lessons  (Table  1):  Now  is  the  hour,  and  reaction  time  is  of  the  essence! 

Formulation  of  EPIC  Computational  Models 

Proceeding  further  on  the  basis  of  EPIC,  we  formulate  explicit  computational  models  of 
multiple-task  performance  in  terms  of  complementary  production-rale  sets,  which  specify  the 
operations  of  EPICs  cognitive  processor.  First,  for  each  task  at  hand,  a  distinct  set  of  production 
rules  that  perform  the  task  with  the  architecture's  various  components  must  be  written.  The  task 
production  rules  translate  intermediate  stimulus  codes  to  intermediate  response  codes  and  keep  other 
records  associated  with  the  individual  tasks.  Second,  a  set  of  production  rales  for  a  supervisory 
executive  process  must  be  written.  The  executive  production  rules  adaptively  coordinate  progress  on 
the  individual  tasks  so  that  instructions  about  task  priorities  are  obeyed  and  the  tasks  do  not  disrupt 
each  other  at  peripheral  perceptual-motor  levels.  Such  coordination  is  achieved  by  monitoring  the 
contents  of  working  memory  and  inserting  or  deleting  task  goals  together  with  other  control  items  at 
appropriate  moments  along  the  way.  For  example,  consider  the  following  executive  production  rule: 

IF 

{(GOAL  DO  DUAL  CHOICE  RT  TASKS) 

(STRATEGY  AUDITORY-MANUAL  TASK  1) 

(STRATEGY  VISUAL-MANUAL  TASK  2) 

(VISUAL  CENTER  EVENT  DETECTED  ON) 

(NOT  (TRIAL  UNDERWAY) ) ) 

THEN 

( ( SEND-TO-MOTOR  MANUAL  RESET) 

(ADDDB  (TRIAL  UNDERWAY) ) 

(ADDDB  (GOAL  DO  TASK  1)) 

(ADDDB  (GOAL  DO  TASK  2)) 

(ADDDB  (STRATEGY  TASK  2  MODE  IS  DEFERRED)) 

(ADDDB  (STRATEGY  UNLOCK  ON  MOTOR-SIGNAL  MANUAL  STARTED  LEFT)) 
(DELDB  (VISUAL  CENTER  EVENT  DETECTED  ON) ) 

(ADDDB  (STEP  MOVE  EYES  TO  RIGHT)) 

(ADDDB  (STEP  WAIT-FOR  TASK  1  DONE)))) . 
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This  rule  might  be  applied  to  start  processing  for  primary  and  secondary  choice-reaction  tasks  of  a 
PRP  procedure  while  ensuring  that  primary-task  responses  have  higher  priority  than  secondary-task 
responses.  More  generally,  the  executive  production  rules  for  scheduling  and  coordinating  tasks 
may  change,  depending  on  the  particular  task  combinations,  priorities,  and  subjective  strategies  that 
are  involved.  Our  EPIC  computational  models  of  multiple-task  performance  therefore  extend 
previous  proposals  by  theorists  who  have  emphasized  the  importance  of  supervisory  control  in 
cognition  and  action  (e.g.,  Baddeley,  1986;  Duncan,  1986;  Logan,  1985;  Neisser,  1967;  Norman  & 
Shallice,  1986;  Shallice,  1972). 

Assessment  of  EPIC  Computational  Models 

We  assess  our  EPIC  computational  models  by  simulating  multiple-task  performance  under  test 
conditions  that  mimic  those  in  which  empirical  data  from  human  participants  are  collected.  During 
these  assessments,  an  environment-simulation  program  and  human-simulation  pro^am  are  executed 
conjointly  on  a  computer  workstation.  The  environment-simulation  program  provides  a  sequence  of 
stimulus  inputs  to  the  human-simulation  program  and  receives  a  sequence  of  response  outputs  from 
it,  just  as  an  experimenter  would  test  a  human  participant  by  presenting  real  stimuli  and  observing 
his  or  her  overt  behavior.  The  human-simulation  program  consists  of  the  EPIC  architecture  and 
production-rule  sets  in  EPIC's  cognitive  processor,  which  transform  stimulus  inputs  to  response 
outputs  through  systematic  operations  like  those  outlined  before  (Figure  1).  Both  the  environment- 
simulation  program  and  EPIC's  software  modules  are  written  in  LISP.  The  sets  of  executive  and 
task  production  rules  used  for  the  human-simulation  program  conform  to  the  syntax  of  the  PPS 
production-rule  interpreter  (Covrigam  &  Kieras,  1987).  Also,  as  detailed  elsewhere  (Meyer  & 
Kieras,  1997a,  1997b),  execution  of  the  simulation  programs  entails  setting  the  numerical  values  of 
parameters  in  the  task  environment  and  EPIC  architecture. 

After  each  simulation  run,  EPIC's  outputs  may  be  compared  with  observed  results  from  human 
participants.  Insofar  as  the  simulated  data  do  or  do  not  match  empirical  data,  this  would  suggest  that 
our  models  should  or  should  not  be  taken  as  potentially  veridical  descriptions  of  how  human 
multiple-task  performance  is  achieved.  We  have  found  that  for  at  least  some  models,  good  fits 
between  simulated  and  empirical  data  (e.g.,  RTs  and  error  rates)  may  be  obtained  with  relatively  few 
"free"  parameters. 

Overview  of  Applications 

The  subsequent  sections  of  this  chapter  describe  three  representative  task  domains  for  which  we 
have  formulated  and  applied  some  EPIC  computational  models  of  multiple-task  performance.  These 
domains  include:  (a)  the  PRP  procedure,  a  basic  laboratory  paradigm  that  embodies  some 
fundamental  aspects  of  multiple-task  performance  also  found  under  real-world  circumstances; 

(b)  human-computer  interaction  in  a  practical  context,  the  servicing  of  requests  by  customers  to 
telephone  operators  for  the  initiation  of  collect  phone  calls;  and  (c)  concurrent  visual-manual 
tracking  and  tactical  decision  making  in  military  aircraft  operation,  another  practical  context.  From 
focussing  on  these  diverse  task  domains  and  our  models  for  them,  the  potential  value  of  unification 
in  human-performance  theory  may  become  clearer,  and  more  lessons  relevant  to  the  search  for  a 
unified  theory  of  cognition  and  action  may  emerge. 

Application  to  The  PRP  Procedure 

The  PRP  procedure  is  a  popular  laboratory  paradigm  for  studing  human  multiple-task 
performance  (Bertelson,  1966;  Kantowitz,  1974;  Meyer  &  Kieras,  1997a,  1997b;  Pashler,  1994a; 
Smith,  1967).  Many  chapters  in  Attention  and  Performance  volumes  have  been  based  on  it  (e.g.,  see 
Komblum,  1973;  Koster,  1969;  Meyer  &  Komblum,  1993;  Sanders,  1967, 1970).  Such  popularity 
stems  from  the  PRP  procedure's  simplicity,  fecundity,  and  similarity  to  important  real-world 
situations  in  which  people  must  perform  perceptual-motor  and  cognitive  tasks  concurrently.  It 
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therefore  is  befitting  that  we  have  formulated  our  initial  EPIC  computational  models  to  explain  and 
predict  representative  PRP  data  (Meyer  &  Kieras,  1992, 1994, 1996, 1997a,  1997b;  Meyer  et  al., 
1995).  Any  bona  fide  UTC  should  take  these  data  seriously,  and  as  we  show  later,  doing  so  sets  the 
stage  for  analyses  of  multiple-task  performance  in  more  complex  contexts. 

Methodology 

In  a  representative  experiment  with  the  PRP  procedure,  there  is  a  series  of  discrete  trials  during 
which  two  distinct  tasks  must  be  performed  more  or  less  concurrently.  On  each  trial,  a  warning 
signal  is  followed  by  a  stimulus  (e.g.,  visual  letter  or  auditory  tone)  for  the  first  task.  Given  the 
Task  1  stimulus,  a  participant  must  make  a  fast  and  accurate  Task  1  response  (e.g.,  press  a  finger  key 
or  say  a  word).  Soon  after  the  Task  1  stimulus,  another  stimulus  is  presented  for  the  second  task. 
The  perceptual  modality  and  semantic  categoi^  of  the  Task  2  stimulus  may  differ  from  those  of  the 
Task  1  stimulus.  The  time  between  the  two  stimuli  is  the  stimulus-onset  asynchrony  (SOA),  which 
typically  ranges  between  zero  and  1  s.  Given  the  Task  2  stimulus,  the  participant  must  make  a  fast 
and  accurate  Task  2  response.  The  effector  for  the  Task  2  response  may  differ  from  that  for  the 
Task  1  response.  In  most  cases,  instructions  for  the  PRP  procedure  require  that  Task  1  has  higher 
priority  than  Task  2;  they  may  also  urge  participants  to  make  the  Task  1  response  first  (cf. 

Footnote  11). 

The  experimenter  analyzes  the  Task  1  and  Task  2  RTs  to  assess  how  much  the  two  tasks 
interfere  with  each  other.  Specifically,  mean  Task  2  RTs  may  be  plotted  versus  the  SOA,  forming 
PRP  curves  that  ordinarily  decline  as  the  SOA  increases.  Depending  on  various  methodological 
details,  this  SOA  effect  —  also  called  the  PRP  effect  —  can  either  add  or  interact  with  the  effects  of 
other  factors  (e.g.,  stimulus  discriminability,  response-selection  difficulty,  and  movement 
complexity).  Models  of  multiple-task  performance  should  account  for  the  absolute  magnitudes  of 
the  RTs  and  the  observed  patterns  of  factor  effects  on  them. 

Lessons  from  The  PRP  Procedure 

From  the  PRP  procedure  and  our  efforts  to  formulate  EPIC  computational  models  for 
performance  under  it,  several  substantive  and  methodological  lessons  may  be  learned  (Tables  1 
and  2). 

Methodological  Lesson  4:  Our  cup  runneth  over.  The  literature  of  experimental  psychology 
contains  many  alternative  patterns  of  PRP  curves,  manifesting  various  additivities  and  interactions 
among  the  effects  of  SOA  and  other  factors  on  mean  Task  2  RTs.  Even  when  we  confine  our 
attention  to  factors  that  presumably  affect  just  one  particular  processing  stage  (e.g.,  response 
selection)  for  the  secondary  task,  considerable  diversity  appears  in  the  PRP-curve  patterns  obtained 
through  manipulating  those  factors.  Metaphorically,  our  cup  runneth  over.  The  PRP  procedure 
provides  a  copious  fermentation  of  data  to  be  explained  quantitatively  and  modeled  computationally. 

Substantive  Lesson  1:  Response  selection  is  not  like  pouring  bottled  wine.  Because  of  the 
apparent  diversity  in  PRP-curve  patterns,  a  cmcial  substantive  lesson  has  emerged  as  well.  Results 
from  our  computational  modeling  suggest  that  after  only  moderate  practice,  people  do  not  select 
responses  to  stimuli  through  an  immutable  stmctural  bottleneck  whose  inherent  capacity  limits 
preclude  it  from  dealing  with  more  than  one  task  at  a  time.  Response  selection  is  not  like  pouring 
bottled  wine.  On  the  contrary,  responses  to  two  or  more  stimuli  apparently  can  be  and  are 
sometimes  selected  concurrently,  thereby  enabling  substantial  temporal  overlap  between  streams  of 
processing  for  different  tasks.  Although  in  accord  with  our  initial  assumptions  about  EPIC's 
cognitive  processor,  such  overlap  patently  contradicts  some  past  hypotheses  about  how  multiple-task 
performance  takes  place  under  the  PRP  procedure. 

Methodological  Lesson  5:  Expect  the  unexpected.  This  contradiction  has  taught  us  a  fifth 
methodological  lesson,  which  concerns  the  attitude  that  one  should  adopt  when  pursuing  theoretical 
unification.  Unexpected  conceptual  twists  and  turns  are  to  be  expected  along  the  way.  For  example, 
before  our  journey  with  EPIC  began,  the  best  known  explanation  of  the  PRP  effect  on  secondary- 
task  RTs  was  the  response-selection  bottleneck  (RSB)  hypothesis  (Welford,  1959,  1967;  McCann  & 
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Johnston,  1992;  Pashler,  1984, 1990, 1993, 1994a;  Pashler&  Johnston,  1989).  Past  theorists  have 
argued  for  it  based  on  quasi-additive  effects  of  SOA  and  Task  2  response-selection  factors  in  some 
PRP  experiments.  However,  EPIC  computational  models  of  people's  performance  under  this 
procedure  raise  grave  doubts  about  these  arguments.  Such  abrupt  changes  in  theoretical  direction 
are  likely  to  happen  repeatedly  down  the  road,  because  a  major  benefit  of  seeking  UTCs  is  the 
discovery  of  new  and  uncharted  but  promising  conceptual  territory. 

Methodological  Lesson  6:  Be  careful  what  you  ask  for;  you  might  actually  get  it.  Also 
relevant  here  is  a  sixth  methodological  lesson  that  pertains  directly  to  why  past  theorists  have  been 
misled  in  their  advocacy  of  the  traditional  RSB  hypothesis.  As  mentioned  before,  the  instmctions  to 
participants  under  the  PRP  procedure  request  that  Task  1  responses  have  higher  priority  and  earlier 
onsets  than  Task  2  responses.  This  request  explicitly  constrains  participants  to  use  some  bottleneck 
mechanism  that  restricts  the  "flow"  of  information  processing  for  Task  2,  so  that  Task  2  responses  do 
not  occur  before  Task  1  responses  at  short  SOAs.  For  example,  in  an  EPIC  computational  model, 
precluding  out-of-order  Task  2  responses  requires  imposing  temporary  software  bottlenecks  through 
executive  control.  Theorists  who  advocate  the  traditional  RSB  hypothesis  may  have  been  misled 
because  experimenters  who  adopted  the  PRP  procedure  actually  got  the  kind  of  performance  that 
participants  were  asked  to  produce.  Consistent  with  Methodological  Lesson  6,  care  therefore  must 
be  taken  to  accomodate  the  role  of  task  instructions  in  modeling  multiple-task  performance  (cf. 
Newell,  1973a,  1990). 

An  Instructive  PRP  Study 

We  first  learned  these  lessons  through  formulating  EPIC  computational  models  to  account  for 
results  from  an  instructive  PRP  study  by  Hawkins,  Rodriguez,  and  Reicher  (1979). 

Procedure.  In  Hawkins  et  al.'s  study,  there  were  four  different  primary  tasks,  which  involved 
either  auditory  stimuli  (tones)  or  visual  stimuli  (printed  letters)  and  either  manual  responses 
(keypresses  by  left-hand  fingers)  or  vocal  responses  (spoken  words).  Participants  performed  each 
primary  task  together  with  one  or  the  other  of  two  different  secondary  tasks,  which  involved  either 
two  or  eight  visual  stimuli  (digits)  and  two  manual  responses  (keypresses  by  right-hand  fingers).  For 
each  combination  of  Tasks  1  and  2,  the  SOAs  ranged  from  0  to  1200  ms.  These  manipulations  let 
participants'  RTs  for  the  two  tasks  be  measured  jointly  as  a  function  of  the  SOA,  Task  1  perceptual 
modality.  Task  1  motor  modality,  and  Task  2  response-selection  difficulty  (Sanders,  1980; 

Sternberg,  1969).*®  Thus,  Hawkins  et  al.'s  (1979)  study  provides  a  large  set  of  data  with  which  to 
test  alternative  models  of  basic  human  multiple-task  performance,  thereby  exemplifying  the  riches 
mentioned  before  in  Methodological  Lesson  4. 

Results.  Figure  2  shows  empirical  mean  Task  1  and  Task  2  RTs  reported  by  Hawkins  et  al. 
(1979).  Several  salient  aspects  of  these  data  should  be  noticed. 

First,  consider  the  empirical  mean  Task  1  RTs  (large  unfilled  symbols  on  solid  curves). 
Primary-task  responses  took  longer  when  the  Task  1  stimuli  were  auditory  rather  than  visual 
(Figures  2 A  and  2C  vs.  Figures  2B  and  2D)  and  when  the  Task  1  responses  were  vocal  rather  than 
manual  (Figures  2 A  and  2B  vs.  Figures  2C  and  2D).  These  effects  of  the  Task  1  stimulus  and 
response  modalities  were  essentially  additive.  This  suggests  that  stimulus  encoding  and  movement 
production  for  Task  1  occurred  during  temporally  separate  stages  of  processing  (Sternberg,  1969). 


*®  Reliable  effects  on  response  selection  for  Task  2  presumably  were  caused  by  Hawkins  et  al.’s  manipulation  of  S-R 
numerosity.  To  infer  that  such  effects  occurred  there  is  highly  plausible  from  results  of  previous  research  with  S-R 
numerosity  manipulations  (Schumacher,  Lauber,  Glass,  Zurbriggen,  Gmeindl,  Kieras,  &  Meyer,  1997).  For  example, 
Sternberg  (1969)  found  large  reliable  interactions  between  the  effects  on  mean  RTs  of  S-R  numerosity  and  S-R 
compatibility,  a  prototypical  factor  that  is  believed  to  affect  response  selection.  By  contrast,  Sternberg  (1969)  found 
relatively  little  interaction  between  S-R  numerosity  and  stimulus-legibility  effects,  suggesting  that  S-R  numerosity 
affects  stimulus  encoding  hardly  at  all.  These  and  additional  related  data  imply  that  most,  if  not  all,  of  the  S-R 
numerosity  effect  on  mean  RTs  occurs  during  response  selection  rather  than  other  stages  of  processing  (Brainard,  Irby, 
Fitts,  &  Alluisi,  1962;  Broadbent  &  Gregory,  1965;  Gottsdanker,  1969;  Sanders,  1980;  Theios,  1973). 
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FigMre  2.  Results  based  on  the  PRP  study  by  Hawkins  et  al.  (1979).  Large  symbols  on  solid  curves 
represent  empirical  mean  RTs;  small  symbols  on  dashed  curves  represent  simulated  mean  RTs 
produced  by  the  strategic  response-deferment  (SRD)  model.  Filled  circles  and  triangles  represent 
mean  Task  2  RTs  when  response-selection  in  Task  2  was  respectively  easy  or  hard;  unfilled  circles 
and  triangles  represent  corresponding  mean  Task  1  RTs.  A:  Simulated  versus  empirical  mean  RTs 
for  a  combination  of  auditory-vocal  Task  1  and  visual-manual  Task  2.  B:  Simulated  versus 
empirical  mean  RTs  for  a  combination  of  visual-vocal  Task  1  and  visual-manual  Task  2.  C: 
Simulated  versus  empirical  mean  RTs  for  a  combination  of  auditory-manual  Task  1  and  visual- 
manual  Task  2.  D:  Simulated  versus  empirical  mean  RTs  for  a  combination  of  visual-manual  Task  1 
and  visual-manual  Task  2. 
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Furthermore,  regardless  of  the  Task  1  stimulus  and  response  modalities,  neither  the  SOA  nor  the 
difficulty  of  response  selection  in  Task  2  affected  the  mean  Task  1  RTs  very  much.  This  suggests 
that  consistent  with  typical  instructions  in  the  PRP  procedure,  Hawkins  et  al.'s  (1979)  participants 
almost  invariably  gave  Task  1  priority  over  Task  2. 

Next,  let  us  consider  the  empirical  mean  Task  2  RTs.  Regardless  of  which  Task  1  was  involved, 
the  difficulty  of  Task  2  affected  mean  Task  2  RTs  directly  (Figures  2A  through  2D),  as  we  would 
expect  if  Task  2  response  selection  took  longer  when  Task  2  involved  more  S-R  pairs.  Also, 
regardless  of  which  Task  1  was  involved,  the  SOA  affected  mean  Task  1  RTs  inversely.  Presumably 
this  happened  because  after  the  shorter  SOAs,  some  stage  of  processing  in  Task  2  had  to  be 
postponed  temporarily  until  processing  for  Task  1  progressed  enough  that  Task  2  responses  would 
seldom,  if  ever,  precede  Task  1  responses. 

However,  the  relationship  between  the  effects  of  SOA  and  Task  2  difficulty  on  the  mean  Task  2 
RTs  differed  as  a  function  of  which  Task  1  was  involved.  For  example,  when  a  visual-manual 
Task  1  was  involved  (Figure  2D),  mean  Task  2  RTs  were  affected  almost  additively  by  the  SOA  and 
Task  2  response-selection  difficulty.  Such  additivity  also  tended  to  occur  when  a  visual-vocal 
Task  1  was  involved  (Figure  2B).  Given  locus-of-slack  logic  (McCann  &  Johnston,  1992;  Meyer  & 
Kieras,  1997a,  1997b;  Pashler,  1984),  these  additivities  would  be  consistent  with  the  traditionsJ  RSB 
hypothesis.  Nevertheless,  when  either  an  auditory-vocal  or  auditory-manual  Task  1  was  involved, 
substantial  interactions  occurred  between  the  effects  of  SOA  and  Task  2  response-selection  difficulty 
on  mean  Task  2  RTs  (Figures  2A  and  2C);  the  difficulty  effects  were  considerably  less  at  shorter 
SOAs  than  at  longer  SOAs,  yielding  marked  underadditive  interactions.  By  locus-of-slack  logic, 
such  underadditivity  is  inconsistent  with  the  RSB  hypothesis  (Karlin  &  Kestenbaum,  1968;  Keele, 
1973;  Meyer  &  Kieras,  1997a,  1997b;  Schvaneveldt,  1969).  On  the  contrary,  it  appears  here  that 
response-selection  processes  for  Tasks  1  and  2  took  place  concurrently  after  shorter  SOAs. 

Such  heterogeneous  patterns  of  additivity  and  interaction  involving  manipulations  of  SOA  and 
other  factors  (e.g.,  S-R  compatibility)  that  influence  response-selection  difficulty  for  Task  2  also 
have  been  reported  elsewhere  (e.g.,  Schumacher,  Glass,  Lauber,  Gmeindl,  Woodside,  BCieras,  & 
Meyer,  1996).  Frequent,  but  not  universal,  temporal  overlap  of  the  response-selection  processes  for 
Tasks  1  and  2  therefore  may  occur  under  the  procedure.  Of  course,  this  is  what  led  us  to 
Substantive  Lesson  1,  Methodogical  Lesson  5,  and  Methodogical  Lesson  6  at  the  start  of  the  present 
section. 

Theoretical  implications.  Given  these  lessons  and  the  discoveries  on  which  they  rest,  the 
traditional  RSB  hypothesis  must  be  abandoned.  Instead,  a  new  and  more  apt  account  of  multiple- 
task  performance  that  takes  what  we  have  learned  to  heart  is  needed  for  the  PRP  procedure.  The 
next  subsection  outlines  what  this  new  account  entails. 

Adaptive  Executive-Control  Models 

To  account  quantitatively  for  human  performance  under  the  PRP  procedure,  we  have  formulated 
a  class  of  adaptive  executive  control  (AEG)  models  based  on  the  EPIC  architecture  (Meyer  & 

Kieras,  1996,  1997a,  1997b;  Meyer  et  al.,  1995).  Our  AEC  models  incorporate  executive  processes 
that  flexibly  control  the  extent  to  which  secondary-task  processes  may  overlap  temporally  with 
primary-task  processes.  Figure  3  outlines  how  such  control  is  achieved. 

According  to  this  view,  performance  of  each  task  progresses  through  a  sequence  of  stages, 
including  stimulus  identification,  response  selection,  and  movement  production,  consistent  with 
discrete  stage  models  (Sternberg,  1969;  Sanders,  1980).  An  executive  process  coordinates  progress 
on  the  primary  and  secondary  tasks  by  optionally  postponing  one  or  more  stages  of  processing  for 
Task  2  until  Task  1  has  finished.  The  supervisory  functions  of  the  executive  process  include 

(a)  enabling  the  primary-task  and  secondary-task  processes  to  begin  at  the  start  of  each  trial; 

(b)  specifying  a  temporary  Task  2  lockout  point;  (c)  specifying  a  temporary  Task  1  unlocking  event; 
(d)  waiting  for  the  Task  1  unlocking  event  to  occur;  and  (e)  unlocking  Task  2  processes  so  that  their 
responses  may  be  completed.  Together,  these  functions  ensure  that  instructions  associated  with  the 
PRP  procedure  are  satisfied  (i.e..  Task  1  responses  receive  higher  priority  and  occur  before  Task  2 
responses)  even  though  there  is  enough  cognitive  processing  capacity  to  perform  the  two  tasks 
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Figure  3.  Adaptive  executive  control  (AEG)  models  of  multiple-task  performance  for  the  PRP 
procedure.  Diagonal  lines  with  arrows  that  extend  rightward  from  executive  processes  to  secondary- 
task  processes  indicate  alternative  Task  2  lockout  points  (right-side  ovals).  Diagonal  lines  with 
arrows  that  extend  leftward  from  executive  processes  to  primary-task  processes  indicate  alternative 
Task  1  unlocking  events  (left-side  ovals). 
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concurrently  with  little  or  no  between-task  interference.  Through  the  particular  combination  of 
Task  2  lockout  point  and  Task  1  unlocking  event  that  it  uses,  the  executive  process  can  adjust 
exactly  how  much  priority  is  given  to  Task  1  over  Task  2. 

Task  2  lockout  points.  By  definition,  the  Task  2  lockout  point  is  a  point  during  the  course  of 
Task  2  such  that  when  it  has  been  reached,  further  processing  for  Task  2  stops  temporarily  until 
Task  1  enters  a  "done"  state.  Under  our  AEC  models,  there  are  at  least  three  alternative  Task  2 
lockout  points  (Figure  3,  right-side  ovals),  located  respectively  before  the  onsets  of  stimulus 
identification,  response  selection,  and  movement  production  for  Task  2.  Depending  on  whether  the 
executive  process  uses  a  pre-movement,  pre-selection,  or  pre-identification  lockout  point,  the  Task  2 
processes  would  overlap  more  or  less  with  Task  1  processes  after  short  SOAs. 

Task  I  unlocking  events.  The  amount  of  temporal  ovelap  between  Task  1  and  Task  2  processes 
also  depends  on  which  Task  1  unlocking  event  is  used.  By  definition,  this  is  an  event  during  the 
course  of  Task  1  such  that  when  it  occurs.  Task  1  is  deemed  to  be  "done,"  and  the  executive  process 
permits  processing  for  Task  2  to  progress  beyond  the  Task  2  lockout  point.  Under  our  AEC  models, 
there  are  several  alternative  Task  1  unlocking  events  (Figure  3,  left-side  ovals);  Task  1  may  be 
deemed  "done"  immediately  after  either  its  stimulus-identification,  response-selection,  or  movement- 
production  stage  finishes.  Depending  on  whether  the  executive  process  uses  a  post-  identification, 
post-selection,  or  post-movement  unlocking  event.  Task  2  processes  would  overlap  more  or  less  with 
Task  1  processes  after  short  SOAs. 

Particular  cases.  Overall,  the  class  of  AEC  models  includes  many  particular  cases.  For  each 
possible  combination  of  Task  2  lockout  point  and  Task  1  unlocking  event,  there  is  a  specific  set  of 
executive  production  rules  that  can  implement  this  combination,  achieving  a  certain  preferred 
amount  of  temporal  overlap  between  the  two  tasks.  Which  executive  rule  set  is  used  under  what 
circumstances  would  vary  with  task  instructions,  strategic  goals,  perceptual-motor  requirements, 
prior  practice,  cognitive  style,  and  personal  preference. 

From  this  perspective,  the  choice  of  a  lockout-point  and  unlocking-event  combination  is 
analogous  to  the  choice  of  a  decision  criterion  (beta)  in  signal-detection  theory  (Tanner  &  Swets, 
1954),  which  may  vary  with  the  relative  payoffs  and  costs  assigned  to  one  type  of  response  outcome 
versus  another.  For  example,  some  models  within  the  AEC  class  mimic  a  response-selection 
bottleneck  by  using  a  pre-selection  lockout  point  for  Task  2  and  a  post-selection  unlocking  event  for 
Task  1.  Such  "cautious"  task  scheduling  could  be  preferred  when  prevailing  circumstances  strongly 
encourage  that  Task  1  responses  always  precede  Task  2  responses.  Other  models  within  the  AEC 
class  mimic  a  movement-initiation  bottleneck  by  using  a  post-selection/pre-movement  lockout  point 
for  Task  2  and  a  post  motor-initiation  unlocking  event  for  Task  1  (cf.  De  Jong,  1993;  Keele,  1973). 
Such  "daring"  task  scheduling  could  be  preferred  instead  when  circumstances  strongly  encourage 
that  Task  2  responses  be  produced  almost  as  quickly  as  Task  1  responses. 

Strategic  Response-Deferment  Model 

Among  models  in  the  AEC  class,  one  with  which  we  have  worked  extensively  is  the  strategic 
response-deferment  (SRD)  model.  This  model  is  interesting  and  apt  because  as  each  trial  evolves 
during  the  PRP  procedure,  its  executive  process  first  uses  a  post-response-selection  lockout  point  for 
Task  2  but  later  briefly  imposes  a  pre-response-selection  lockout  point,  depending  on  how  far  Task  2 
processes  have  progressed  by  when  the  prespecified  Task  1  unlocking  event  occurs.  Given  such 
adaptive  executive  control,  mean  Task  2  RTs  produced  by  the  SRD  model  closely  match  various 
patterns  of  empirical  PRP  curves  from  previous  experiments  with  the  PRP  procedure.  The  model’s 
goodness-of-fit  takes  into  account  the  effects  of  both  SOA  and  manipulations  in  the  relative 
difficulties  of  the  primary  and  secondary  tasks  (Meyer  &  Kieras,  1997a,  1997b;  Meyer  et  al.,  1995). 

Details  of  executive  process.  Figure  4  outlines  the  executive  process  of  the  SRD  model  in  more 
detail.  At  the  start  of  each  trial  during  the  PRP  procedure,  the  executive  process  puts  Task  1  in  an 
immediate  response-transmission  mode  and  Task  2  in  a  deferred  response-transmission  mode. 

While  Task  2  is  in  deferred  mode,  the  symbolic  identities  of  Task  2  responses  may  be  selected  and 
sent  to  declarative  working  memory,  but  overt  Task  2  response  movements  are  not  produced  by 
EPIC'S  motor  processors.  This  constraint  is  imposed  by  adding  a  special  control  note  to  working 
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Figure  4.  Steps  taken  by  the  executive  process  of  the  SRD  model  to  unlock  Task  2  processes  for  the 
procedure  after  Task  1  has  been  declared  "done".  Breaks  in  the  vertical  time  lines  shown  by 
diagonal  hash  marks  represent  variable  time  intervals  whose  durations  depend  on  the  SOA  and 
temporal  properties  of  prior  processes. 
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memory,  which  specifies  a  post-selection/pre-movement  lockout  point  for  Task  2.  Putting  Task  1  in 
immediate  mode  lets  its  responses  be  selected  and  sent  to  their  motor  processor  as  quickly  as 
possible  for  movement  production.  This  freedom  is  enabled  by  adding  another  control  note  to 
working  memory.  When  the  Task  1  unlocking  event  occurs  subsequently  (e.g.,  the  overt  Task  1 
response  movement  is  initiated),  the  executive  process  temporarily  suspends  Task  2  and  shifts  it  to 
immediate  mode,  after  which  Task  2  is  resumed.  Following  this  transition,  the  identities  of 
previously  selected  Task  2  responses  may  be  transferred  from  working  memory  to  their  motor 
processor  for  movement  production.  If  response  selection  has  not  yet  finished  for  Task  2  before  it  is 
shifted  to  immediate  mode,  then  subsequently  the  Task  2  production  rules  will  both  select  and  send 
the  identities  of  Task  2  responses  directly  to  their  motor  processor. 

Alternative  paths  of  information  processing  and  RT  equations  for  Task  2.  Because  of  how  its 
executive  process  works,  five  alternative  paths  of  information  processing  (different  sequences  of 
operations)  may  lead  from  Task  2  stimuli  to  Task  2  response  movements  in  the  SRD  model  (Meyer 
&  Kieras,  1997a,  Figures  10  through  13).  Which  path  is  taken  during  a  particular  trial  of  the  PRP 
procedure  depends  on  the  SOA  and  the  relative  difficulty  of  Task  1  versus  Task  2.  Associated  with 
each  path  is  a  distinct  equation  that  defines  the  Task  2  RT  in  terms  of  the  model's  parameters  and  the 
SOA  (Meyer  &  Kieras,  1997a,  Table  3).  Under  some  experimental  conditions,  all  five  paths  and 
equations  contribute  to  the  Task  2  RTs  over  the  interval  of  positive  SO  As.  Under  other  experimental 
conditions,  the  Task  2  RTs  stem  from  only  a  subset  of  these  paths  and  equations.  Consequently,  the 
SRD  model  implies  that  the  SOA  and  other  factors  (e.g.,  response-selection  difficulty  for  Task  2) 
can  affect  mean  Task  2  RTs  either  additively  or  interactively,  depending  on  prevailing  experimental 
conditions  (Meyer  &  Kieras,  1997a,  Figure  15). 

Account  of  mean  RTs  from  Hawkins  et  al.  Given  its  theoretical  implications,  we  have  applied 
the  SRD  model  successfully  in  accounting  for  the  mean  RT  data  of  Hawkins  et  al.  (1979).  A 
summary  of  the  obtained  account  appears  in  Figure  2,  which  shows  simulated  mean  RTs  from  the 
SRD  model  versus  empirical  mean  RTs  as  a  function  of  SOA  for  various  combinations  of  primary 
and  secondary  tasks.  With  respect  to  both  Task  1  and  Task  2,  the  simulated  RTs  fit  the  empirical 
RTs  fairly  well  regardless  of  which  perceptual  and  motor  modalities  were  involved  during  Task  1. 
For  Task  2,  the  simulated  RTs  accurately  approximate  the  interactive  and  additive  effects  of  SOA 
and  response-selection  difficulty  on  the  empirical  RTs.  The  SRD  model  required  relatively  few 
context-dependent  parameters  to  achieve  its  goodness-of-fit;  the  number  of  parameter  values  used 
here  was  markedly  less  than  the  number  of  reliable  one-degree-of-freedom  contrasts  in  the  empirical 
mean  RT  data.  Similarly,  accurate  parsimonius  quantitative  accounts  of  mean  RTs  from  a  variety  of 
other  studies  with  the  Pl^  procedure  (e.g.,  Karlin  &  Kestenbaum,  1968;  McCann  &  Johnston,  1992; 
Pashler,  1990)  have  been  provided  by  the  SRD  model  (Meyer  &  Kieras,  1997a,  1997b). 

Further  Lessons 

Our  SRD  model  and  other  related  members  of  the  AEC  class  also  have  taught  us  more 
substantive  and  methodological  lessons  (Tables  1  and  2). 

Substantive  Lesson  2:  It's  difficult  to  leap  before  you  look.  A  second  substantive  lesson 
concerns  the  role  of  eye  movements  during  concurrent  choice-reaction  tasks.  As  Figure  2  indicates, 
large  underadditive  interactions  between  the  effects  of  SOA  and  response-selection  difficulty  on 
mean  Task  2  RTs,  which  manifest  temporal  overlap  between  concurrent  response-selection 
processes,  occurred  in  the  PRP  study  by  Hawkins  et  al.  (1979)  when  Task  1  involved  auditory 
stimuli  and  Task  2  involved  visual  stimuli.  However,  when  both  tasks  involved  visual  stimuli,  these 


In  some  respects,  the  SRD  model  resembles  the  hybrid  structural -bottleneck  model  of  De  Jong  (1993).  He  proposed 
that  both  response-selection  and  movement-initiation  bottlenecks  mediate  multiple-task  performance,  integrating 
hypotheses  advocated  by  Kantowitz  (1974),  Keele  (1973),  Pashler  (1984,  1994a),  Welford  (1967,  1980),  and  others. 
Similarly,  to  coordinate  progress  on  Tasks  1  and  2  of  the  PRP  procedure,  the  executive  process  of  the  SRD  model  uses 
both  post-response-selection  and  pre-response-selection  lockout  points  for  Task  2.  However,  these  lockout  points  are 
optional,  flexible,  and  adaptively  controlled,  whereas  the  bottlenecks  of  De  Jong’s  (1993)  hybrid  model  are  immutable 
and  insensitive  to  changing  task  requirements. 


27 


Precis  to  A  Unified  Theory 


Meyer  &  Kieras 


SOA  and  difficulty  effects  were  approximately  additive,  suggesting  that  the  selection  of  Task  2 
responses  usually  took  place  after  the  selection  of  Task  1  responses  had  finished.  Why  and  how  did 
this  happen?  Intervening  eye  movements  provide  the  answer. 

When  both  tasks  in  Hawkins  et  al.'s  (1979)  PRP  study  involved  visual  stimuli,  a  large  (>  5°) 
visual  angle  separated  them.  Thus,  on  each  trial  there,  participants  first  had  to  look  at  the  Task  1 
stimulus  and  later  had  to  make  a  time-consuming  saccadic  eye  movement  to  look  at  the  Task  2 
stimulus.  This  intervening  saccade,  which  was  not  needed  when  Task  1  involved  auditory  stimuli, 
presumably  perempted  the  response-selection  processes  for  Tasks  1  and  2  from  temporally 
overlapping.  By  when  the  Task  2  stimulus  had  been  identified  and  Task  2  response  selection  had 
begun  after  the  saccade,  Task  1  response  selection  already  would  have  finished  even  if  the  SOA  was 
very  short.  The  consequences  of  this  constraint  are  embodied  in  the  SRD  model's  simulated  Task  2 
RTs  for  these  conditions  (Figures  2B  and  2D).  We  therefore  learn  here  that  although  procedural 
cognitive  processes  may  have  the  capacity  to  select  responses  concurrently  for  multiple  tasks,  such 
capacity  will  not  be  apparent  if  a  peripheral  perceptual -motor  bottleneck  (a.k.a.  structural 
interference-,  Kahneman,  1973)  precludes  its  benefits. 

Substantive  Lesson  2  is  relevant  to  interpreting  results  from  not  only  Hawkins  et  al.'s  (1979) 
PRP  study  but  also  others'  as  well.  For  example,  McCann  and  Johnston  (1992,  Exp.  2)  reported  a 
study  in  which  the  SOA  and  difficulty  of  response  selection  for  Task  2  affected  mean  Task  2  RTs 
almost  additively.  Again  this  additivity  may  have  stemmed  from  manditory  eye  movements 
intervening  between  the  onsets  of  the  Task  1  and  Task  2  stimuli  (Meyer  &  Kieras,  1997b).  If  so, 
then  contrary  to  McCann  and  Johnston's  conclusions,  their  results  would  not  support  the  traditional 
RSB  hypothesis  per  se;  instead,  a  peripheral  perceptual-motor  bottleneck  could  account  for  them. 
Given  such  considerations,  veridical  UTCs  must  incorporate  realistic  treatments  of  the  contributions 
and  constraints  associated  with  eye  movements  during  human  multiple-task  performance. 

Substantive  Lesson  3:  Variety  is  the  spice  of  life.  A  third  substantive  lesson  from  our  work 
with  the  PRP  procedure  is  that  there  are  interesting  individual  differences  in  how  people  coordinate 
multiple-task  performance.  According  to  our  AEC  models  (Figure  3),  people  may  have  two 
alternative  types  of  strategy  for  scheduling  performance  of  the  primary  and  secondary  tasks  in  the 
PRP  procedure  (Meyer  &  Kieras,  1996, 1997b).  One  type  of  strategy  is  cautious.  Cautious 
scheduling  strategies  use  relatively  early  (e.g.,  pre-selection)  Task  2  lockout  points  and  relatively 
late  (e.g.,  post-movement)  Task  1  unlocking  events,  as  if  there  were  a  cognitive  response-selection 
bottleneck.  This  allows  little  temporal  overlap  between  Task  1  and  Task  2  processes,  increasing 
Task  2  RTs  after  short  SOAs  in  order  to  minimize  the  likelihood  that  overt  Task  2  responses  might 
precede  overt  Task  1  responses  and  thereby  violate  the  PRP  procedure's  requirements.  By  contrast,  a 
second  type  of  scheduling  strategy  is  daring.  Daring  scheduling  strategies  use  relatively  late  (e.g., 
post-selection)  Task  2  lockout  points  and  relatively  early  (e.g.,  pre-movement)  Task  1  unlocking 
events,  consistent  with  there  being  no  response-selection  bottleneck.'*  This  allows  more  temporal 
overlap  between  Task  1  and  Task  2  processes,  decreasing  Task  2  RTs  at  short  SOAs  but  increasing 
the  likelihood  that  overt  Task  2  responses  might  precede  overt  Task  1  responses  and  violate  the  PRP 
procedure's  requirements.  Which  scheduling  strategy  is  adopted  under  what  circumstances 
presumably  depends  on  factors  such  as  the  subjective  difficulties  of  Tasks  1  and  2,  people's  prior 
experience  with  multiple-task  situations,  and  their  personal  preferences  for  conservative  or 
aggressive  task  performance. 

Given  these  considerations,  we  predict  that  under  the  PRP  procedure,  some  participants'  RT  data 
would  manifest  cautious  task  scheduling  and  others'  would  manifest  daring  task  scheduling.  Indeed, 
this  prediction  has  been  confirmed  already  in  a  study  by  Lauber,  Schumacher,  Glass,  Zurbriggen, 
Kieras,  and  Meyer  (1994,  Exp.  2;  also  see  Meyer  et  al.,  1995).  Their  study  replicated  the  one  of 
Hawkins  et  al.  (1979)  with  an  auditory-manual  Task  1  and  visual-manual  Task  2,  except  that  Lauber 
et  al.'s  Task  1  was  more  challenging.'®  This  change  encouraged  more  participants  to  do  cautious 


'*  Thus,  by  definition,  the  task-scheduling  strategy  used  in  the  SRD  model  (Figure  4)  is  daring. 
'®  Here  Task  1  involved  four  rather  than  two  alternative  S-R  pairs  (cf.  Hawkins  et  al,  1979). 
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rather  than  daring  task  scheduling  (Meyer  &  Kieras,  1997b). “  As  a  result,  diverse  patterns  of  PRP 
curves  appeared  in  the  participants'  RT  data. 

Figures  5A  and  5B  illustrate  this  diversity  clearly.  In  Figure  5A  are  the  PRP  curves  of  a 
participant  for  whom  the  effect  of  response-selection  difficulty  on  mean  Task  2  RTs  increased 
reliably  as  the  SOA  decreased.  Here  the  negative  SOA-by-difficulty  interaction  suggests  a  cautious 
scheduling  strategy  that  used  a  hybrid  combination  of  pre-selection  and  post-selection  Task  2 
lockout  points  (Meyer  &  Kieras,  1997b).  By  contrast,  in  Figure  5B  are  the  PRP  curves  of  a  different 
participant  for  whom  the  effect  of  response-selection  difficulty  on  mean  Task  2  RTs  decreased 
reliably  as  the  SOA  decreased.  The  latter  positive  SOA-by-difficulty  interaction  is  exactly  opposite 
to  the  former  one  (cf.  Figure  5A),  suggesting  a  daring  scheduling  strategy  that  consistently  used  a 
post-selection  Task  2  lockout  point  (Meyer  &  Kieras,  1997b).  The  difference  between  the  task¬ 
scheduling  strategies  of  these  two  participants  occurred  even  though  their  mean  Task  1  RTs  were 
similar.^'  This  outcome,  consistent  with  Substantive  Lesson  3,  again  demonstrates  how  striving 
toward  a  veridical  UTC  can  uncover  instructive  new  phenomena  that  otherwise  might  go  unnoticed 
from  the  perspective  of  older  theoretical  viewpoints  such  as  the  traditional  RSB  hypothesis.^ 

Substantive  Lesson  4:  Wherever  there's  a  willf  there  are  ways.  Related  to  the  preceding 
analysis  is  a  fourth  substantive  lesson;  executive  cognitive  processes  and  various  strategies  of  task 
performance  are  everywhere.  Apparently  they  play  crucial  roles  not  only  in  complex  cognitive 
domains  like  memory  (Reitman,  1970)  and  problem  solving  (Newell  &  Simon,  1972)  but  also  in 
seemingly  simpler  domains  like  selective  attention  and  elementary  multiple-task  performance  (cf. 
Moray,  1979).  Of  course,  this  generality  is  not  too  surprising  in  light  of  some  past  precedents. 

For  example,  consider  sensory  psychophysics  (Green  &  Swets,  1966;  Krantz,  1969;  Luce,  1963; 
Tanner  &  Swets,  1954).  At  first,  the  dominant  model  in  sensory  psychophysics  was  high-threshold 
theory  (HTT).  Analogous  to  the  traditional  perceptual  and  response-selection  bottleneck  hypotheses 
(Broadbent,  1958;  Pashler,  1994a;  Welford,  1959,  1967),  HTT  assumed  that  human  observers  detect 
simple  sensory  stimuli  (e.g.,  light  flashes  and  tone  bursts)  through  a  discrete  all-or-none  threshold 
mechanism,  wherein  the  subjective  stimulus  intensity  must  exceed  some  constant  absolute  level  to 
be  detected.  Because  of  this  threshold's  putative  rigidity,  it  provided  little  room  for  observers' 
decision  criteria  and  judgment  strategies.  As  a  result,  many  problematic  psychophysical  data  went 
unexplained. 

Ultimately,  however,  signal-detection  theory  (SDT)  emerged  on  the  scene,  reconciling 
phenomena  that  had  previously  bedeviled  HTT.  Unlike  in  HTT,  no  discrete  absolute  high  threshold 
is  assumed  in  SDT.  Instead,  SDT  characterizes  observers'  detection  performance  in  terms  of 
stochastic  processes  that  involve  a  continuum  of  sensory  states  and  adjustable  decision  criteria. 
According  to  this  characterization,  observers  set  their  decision  criteria  (beta  values)  strategically  to 
achieve  various  preferred  frequencies  of  "hits"  for  stimulus  signals  and  "correct  rejections"  for  noise, 
depending  on  prevailing  reward  schemes.  These  strategic  adjustments  account  well  for  the  forms  of 
receiver  operating-characteristic  (ROC)  curves  and  the  different  points  that  observers  adopt  on  them. 
Similar  to  this  account  is  our  use  of  the  SRD  and  other  AEC  models  for  interpreting  the  forms  of 
PRP  curves  (Meyer  et  al.,  1995;  Meyer  &  Kieras,  1997b).  In  essence,  the  Task  2  lockout  points  and 
Task  1  unlocking  events  of  the  AEC  models  play  much  the  same  conceptual  role  as  do  SDT's 
decision  criteria.  If  the  detection  of  simple  sensory  signals  involves  sophisticated  supervisory 


Because  Lauber  et  al.  (1994,  Exp.  2)  increased  the  difficulty  of  Task  1,  the  chances  of  Task  2  responses  occurring 
prematurely  before  Task  1  responses  were  potentially  greater  unless  their  participants  adopted  more  cautious  scheduling 
strategies  than  did  those  in  Hawkins  et  al.'s  (1979)  study. 

For  each  participant  in  Figure  5,  the  mean  Task  1  RTs  averaged  about  500  ms  and  were  not  affected  much  by  either 
SOA  or  Task  2  difficulty. 

Insofar  as  we  know,  few  advocates  of  the  traditional  RSB  hypothesis  have  checked  for  systematic  individual 
differences  in  task-scheduling  strategies.  Presumably  this  omission  has  occurred  because,  according  to  the  RSB 
hypotheses,  all  participants  have  an  immutable  structural  response-selection  bottleneck,  and  so  their  patterns  of  PRP 
curves  would  not  be  expected  to  differ  much  from  each  other. 
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Figure  5.  Results  from  a  study  with  the  PRP  procedure  by  Lauber  et  al.  (1994,  Exp.  2). 

A.  Mean  Task  2  RTs  as  function  of  SOA  and  Task  2  response-selection  difficulty  for  Participant  1. 

B.  Mean  Task  2  RTs  as  function  of  SOA  and  Task  2  response-selection  difficulty  for  Participant  8. 
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executive  processes,  it  is  only  natural  that  multiple-task  performance  does  too.  The  moral  of  our 
story  is  that  a  veridical  UTC  must  acknowledge  such  processes  and  provide  a  basis  for 
computationally  modeling  the  diverse  performance  strategies  associated  with  them. 

Methodological  Lesson  7;  Average  at  your  own  risk.  That  multiple-task  performance  is 
mediated  by  various  task-scheduling  strategies  has  other  methodological  implications.  Results  from 
paradigms  like  the  PRP  procedure  should  not  be  averaged  across  participants  without  first  checking 
for  systematic  individual  differences  among  them.  As  mentioned  elsewhere  (e.g.,  Estes,  1956; 
Meyer,  Osman,  Irwin,  &  Yantis,  1988;  Siegler,  1987),  foregoing  such  checks  can  lead  to  seriously 
erroneous  conclusions. 

For  example,  consider  Figure  6 A.  Here  we  have  plotted  Task  2  RTs  obtained  by  averaging  over 
a  whole  group  of  participants  who  were  in  the  same  experiment  from  which  Figure  5  came.  After 
such  averaging,  the  mean  Task  2  RT  data  appear  to  form  "parallel"  (vertically  equidistant)  PRP 
curves  that  embody  nearly  additive  effects  of  SOA  and  response-selection  difficulty.  If  only  these 
averaged  data  had  been  considered,  then  one  might  conclude  that  the  participants  all  performed 
according  to  the  traditional  RSB  hypothesis  (cf.  McCann  &  Johnston,  1992;  Pashler,  1984, 1994a; 
Pashler  &  Johnston,  1989).  However,  this  conclusion  would  not  be  correct  with  respect  to  the 
individual  participants'  data  patterns,  as  indicated  by  Figures  5 A,  5B,  and  6B. 

In  particular.  Figure  6B  shows  interactions  between  the  effects  of  SOA  and  response-selection 
difficulty  on  mean  Task  2  RTs  for  each  individual  participant  who  contributed  to  Figure  6A.  Here 
the  distribution  of  interactions  is  rather  diffuse;  one  participant  had  an  approximately  null  interaction 
(i.e.,  additive  effects  of  SOA  and  Task  2  difficulty),  but  three  participants  had  markedly  negative 
interactions  (e.g.,  see  Figure  6B,  Participant  1),  and  four  others  had  various  magnitudes  of  positive 
interaction  including  some  that  were  quite  large  (e.g.,  see  Figure  6B,  Participant  8).  This  is  not  what 
would  happen  if  a  traditional  response-selection  bottleneck  mediated  every  participant's 
performance.  To  the  contrary,  some  participants  apparently  used  task-scheduling  strategies  that 
were  cautious  (i.e.,  ones  without  overlapping  response-selection  processes),  but  others  used 
strategies  that  were  much  more  daring  (i.e.,  ones  with  overlapping  response-selection  processes).^^ 
Averaging  the  Task  2  RT  data  across  these  two  subgroups  of  participants  obscures  the  cmcial 
differences  between  them,  creating  a  deceptive  illusion  that  the  traditional  RSB  hypothesis  has  been 
supported.  Consequently,  one  might  wonder  how  often  such  deceptive  illusions  have  occurred  on 
past  occasions  when  experimenters  advocated  the  RSB  hypothesis  after  finding  seemingly  additive 
SOA  and  Task  2  difficulty  effects  in  averaged  group  PRP  curves. 


23  To  support  this  conclusion  further,  one  may  compare  the  light  vertical  bars  (predicted  interactions)  and  dark  vertical 
bars  (observed  interactions)  in  Figure  6B.  Derivations  described  elsewhere  (Meyer  et  al.,  1995;  Meyer  &  Kieras,  1997b) 
yielded  the  light  vertical  bars,  which  assume  that  every  participant  had  a  response-selection  bottleneck.  Given  this 
assumption,  the  light  vertical  bars  should  approximate  the  dark  bars  closely,  but  they  do  not.  A  large  fraction  (i.e.,  7/8) 
of  the  dark  vertical  bars  in  Figure  6B  are  longer  than  the  light  vertical  bars  paired  with  them,  embodying  consistently 
more  extreme  interactions  than  the  traditional  RSB  hypothesis  predicts.  Specifically,  it  appears  that  Figure  6B  contains 
at  least  two  distinct  subgroups  of  participants,  some  of  whom  (e.g..  Participant  1)  produced  significantly  negative 
interactions  between  the  effects  of  SOA  and  response-selection  difficulty  on  mean  Task  2  RTs,  and  others  of  whom  (e.g.. 
Participant  8)  produced  significantly  positive  interactions.  This  supports  our  claim  that  task  scheduling  involves 
adaptive  executive  control  and  that,  because  of  systematic  personal  preferences,  some  but  not  all  participants  may  adopt 
daring  scheduling  strategies  even  when  Task  1  is  relatively  hard. 

24  Standard  statistical  tests  that  have  been  reported  to  demonstrate  additive  factor  effects  in  averaged  group  PRP  curves 
(e.g.,  McCann  &  Johnston,  1992;  Pashler,  1984;  Pashler  &  Johnston,  1989)  do  not  surmount  the  above  problem.  Their 
power  is  greatly  weakened  by  systematic  individual  differences  in  patterns  of  PRP  curves,  because  these  differences 
inflate  the  subject-by-treatment  interactions  that  serve  as  error  variance  In  the  denominators  of  accompanying  t  and  F 
statistics  (Hays,  1963).  Such  inflation  may  explain  why  some  researchers  have  found  seemingly  unreliable  interactions 
between  the  effects  of  SOA  and  other  factors  that  influence  response  selection  for  Task  2  of  the  PRP  procedure.  For 
example,  McCann  and  Johnston  (1992,  Exp.  1)  reported  F(3,  69)  =  1.94  in  the  case  of  a  positive  interaction  between 
SOA  and  S-R  compatibility  effects  on  mean  Task  2  RTs.  This  interaction  was  marginal  (.05  <  p  <  .10  for  a 
unidirectiontil  hypothesis  test),  but  might  have  been  much  more  reliable  in  the  absence  of  underlying  systematic 
individual  differences,  thereby  further  disconfirming  the  traditional  RSB  hypothesis. 
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Figure  6.  Further  results  from  the  study  with  the  PRP  procedure  by  Lauber  et  al.  (1994,  Exp.  2;  cf. 
Figure  5).  A.  Mean  Task  2  RTs  as  function  of  SO  A  and  Task  2  response-selection  difficulty 
obtained  by  averaging  over  a  group  of  eight  participants.  B.  Magnitudes  of  interaction  between  the 
effects  of  SOA  and  Task  2  difficulty  on  mean  Task  2  RTs  for  the  individual  participants  who 
contributed  to  Figure  6A.  Dark  vertical  bars  represent  the  participants’  observed  interactions.  Light 
vertical  bars  represent  the  distribution  of  predicted  interactions  that  should  have  occurred  if  all 
participants  used  the  same  cautious  strategy  of  task  scheduling  and  the  observed  interactions  differed 
only  b^ause  of  between-trial  RT  variance. 
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Substantive  Lesson  5:  You  can  teach  young  dudes  new  tricks.  Despite  the  preceding  concerns, 
we  have  learned  a  fifth  substantive  lesson;  under  at  least  some  circumstances,  most  if  not  all 
participants  in  a  group  can  be  trained  to  adopt  the  same  daring  scheduling  strategy This  follows 
because  the  Task  2  lockout  points  and  Task  1  unlocking  events  in  our  AEG  models  (Figure  3)  ^e 
presumably  flexible  and  open  to  change  through  appropriate  practice  or  instructional  manipulations. 
These  models  therefore  predict  that  even  participants  who  initially  prefer  cautious  scheduling 
strategies  when  Task  1  is  difficult  may  come  eventually  to  prefer  daring  task  scheduling. 

Such  predictions  are  supported  by  more  results  from  the  study  of  Lauber  et  al.  (1994,  Exp.  3). 
They  gave  eight  new  participants  (ones  not  in  Figure  6)  an  initial  three-day  phase  of  variable-priority 
training,  after  which  the  participants  were  tested  in  a  one-day  assessment  phase  with  the  standard 
PRP  procedure.  The  training  phase  followed  Gopher's  (1993)  suggestions  about  how  to  enhance  the 
efficiency  of  dual-task  performance.  It  required  concurrent  auditory-manual  and  visual-manual  tasks 
to  be  performed  with  equal  priority  and  no  constraints  on  the  serial  order  of  stimuli  and  responses. 
The  relative  difficulties  of  the  auditory-manual  and  visual-manual  tasks  also  varied  orthogonally 
across  trial  blocks.  Because  of  instmctions  given  before  the  training  phase  started,  participants  were 
strongly  encouraged  to  overlap  their  response-selection  processes  for  the  two  tasks  regardless  of 
their  difficulty.  After  the  training  phase  ended,  participants  entered  the  subsequent  assessment 
phase.  It  involved  the  same  PRP  procedure  as  had  yielded  Figure  6,  where  a  hard  auditory-manual 
primary  task  was  combined  with  easy  or  hard  visual-manual  secondary  tasks  (Lauber  et  al.,  1994, 
Exp.  2).26 

Some  results  from  this  assessment  appear  in  Figure  7A.  Here,  although  the  same  relatively  hard 
Task  1  is  involved  as  before,  the  mean  Task  2  RTs  do  not  look  like  those  that  Lauber  et  al.  (1994, 
Exp.  2)  obtained  previously  (cf.  Figure  6A).  After  variable-priority  training,  participants  who 
transferred  to  the  standard  PRP  procedure  had  average  group  PRP  curves  that  embodied  a  strong 
positive  interaction  between  the  effects  of  SOA  and  Task  2  response-selection  difficulty .2?  It 
therefore  appears  that  these  participants  often  selected  their  Task  1  and  Task  2  responses 
concurrently  after  short  SOAs,  as  would  happen  through  a  daring  scheduling  strategy  of  the  type 
assumed  in  the  SRD  model  (Figure  4).  Indeed,  the  present  post-training  RT  pattern  looks  much  like 
what  Hawkins  et  al.  (1979)  obtained  previously  with  the  standard  PRP  procedure  when  Task  1  was 
easier  (cf.  Figure  2C). 

Moreover,  it  appears  that  after  variable-priority  training,  most  if  not  all  participants  performed 
in  about  the  same  fashion.  Figure  7B  shows  interactions  between  the  effects  of  SOA  and  response- 
selection  difficulty  on  mean  Task  2  RTs  for  each  participant  who  received  such  training  (Lauber 
et  al.,  1994,  Exp.  3).  These  participants'  interactions,  unlike  previous  ones  (cf.  Figure  6B),  are 
uniformly  positive.  The  distribution  of  observed  interactions  in  Figure  7B  is  very  similar  to  what 
should  have  occurred  if  every  participant  used  a  daring  scheduling  strategy  through  which  Task  1 
and  Task  2  responses  were  selected  concurrently.  This  outcome  proves  clearly  that  people  can  be 
trained  to  perform  concurrent  tasks  without  an  immutable  structural  response-selection  bottleneck.^^ 
These  findings  likewise  extend  other  similar  ones  obtained  by  Gopher  (1993). 


25  The  participants  here  were  college  students.  Whether  Substantive  Lesson  5  likewise  applies  to  older  adults  remains  an 
important  open  question  for  future  work.  Some  initial  research  suggests  that  fortunately  the  answer  may  be  "yes"  (Glass, 
Lauber,  Schumacher,  Kieras,  &  Meyer,  1997;  Kramer,  1996). 

2^  The  results  in  Figure  6  came  from  participants'  third  session  of  testing  with  a  hard  primary  task  and  easy  or  hard 
secondary  task  in  the  standard  PRP  procedure. 

22  The  corresponding  mean  Task  1  RTs  did  not  vary  much  as  a  function  of  SOA  or  response-selection  difficulty  in 
Task  2;  on  average,  they  equalled  485  ms  and  487  ms  when  Task  2  was  easy  and  hard,  respectively.  These  values  are 
close  to  ones  found  by  Lauber  et  al.  (1994,  Exp.  2)  during  their  previous  experiment  with  a  hard  primary  task  in  the  PRP 
procedure.  Initial  variable-priority  training  seems  not  to  have  influenced  participants'  performance  of  Task  1  much  at  all. 

28  In  contrast,  some  advocates  of  the  traditional  RSB  hypothesis  have  argued  for  it  because  PRP  effects  (dual-task 
interference)  persist  throughout  many  thousands  of  practice  trials  under  the  standard  PRP  procedure  (Gottsdanker  & 
Stelmach,  1971;  Pashler,  1993).  However,  such  persistence  is  not  antithetical  to  our  present  conclusions.  As  mentioned 
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Figure  7.  Results  reported  by  Lauber  et  al.  (1994,  Exp.  3)  for  a  standard  PRP  procedure  with  a  hard 
auditory-manual  Task  1  and  easy  or  hard  visual-manual  secondary  task  after  participants  had  three 
days  of  initial  variable-priority  training.  A.  Mean  Task  2  RTs  as  function  of  SOA  and  Task  2 
response-selection  difficulty  averaged  over  eight  participants.  B.  Magnitudes  of  interaction  between 
the  effects  of  SOA  and  Task  2  difficulty  on  mean  Task  2  RTs  for  individual  participants.  Dark 
vertical  bars  represent  the  participants'  observed  interactions.  Light  vertical  bars  represent  the 
distribution  of  predicted  interactions  that  should  have  occurred  if  all  participants  used  the  same 
daring  strategy  of  task  scheduling  and  the  observed  interactions  differed  only  because  of  between- 
trial  RT  variance. 


already,  the  instructions  about  task  priorities  for  the  standard  PRP  procedure  typically  require  that  Task  1  receive  more 
emphasis  than  Task  2  and  that  Task  1  responses  be  produced  before  Task  2  responses.  Given  such  instructions,  which 
hold  throughout  practice,  some  PRP  effect  always  must  occur  after  short  SOAs  no  matter  how  practice  changes 
participants'  task  scheduling  strategies  in  other  respects.  Thus,  as  before.  Methodological  Lesson  6  ("Be  careful  what 
you  ask  for;  you  might  actually  get  it")  applies  here  too. 
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Substantive  Lesson  6:  Dual-task  performers  can  share  and  share  alike.  Our  sixth  substantive 
lesson  follows  from  an  even  more  extreme  demonstration  of  the  response-selection  bottleneck’s 
ephemeral  nature.  As  our  assumptions  about  the  capacities  of  EPIC's  cognitive  processor  predict,  we 
have  found  that  through  judicious  instruction  and  training,  people  can  perform  two  generic  choice 
RT  tasks  with  essentially  no  between-task  interference  (Schumacher,  Meyer,  Kieras,  Lauber,  & 
Glass,  1997).  Such  virtually  perfect  time  sharing  occurs  when  five  prerequisite  conditions  prevail  in 
combination:  (1)  participants  are  encouraged  to  give  the  tasks  equal  priority;  (2)  each  task  is 
supposed  to  be  performed  quickly;  (3)  there  are  no  constraints  on  the  temporal  relations  and  serial 
order  among  responses;  (4)  performance  of  one  task  uses  different  perceptual  and  motor  processors 
than  does  performance  of  the  other;  and  (5)  participants  receive  enough  practice  to  compile  complete 
production-rale  sets  for  performing  each  task.  Results  obtained  under  these  conditions  confirm  and 
extend  previous  claims  about  the  existence  of  perfect  time  sharing  in  multiple-task  performance 
(e.g..  Allport  et  al.,  1972;  Greenwald  &  Shulman,  1973;  Hirst  et  al.,  1980;  Koch,  1993, 1994; 

Shaffer,  1975;  Wickens,  1984). 

Specifically,  consider  Figure  8,  which  shows  mean  RTs  from  the  fifth  session  of  an  experiment 
that  satisfied  the  aforementioned  conditions  (Schumacher  et  al.,  1997).  In  this  experiment,  there 
were  an  auditory-vocal  RT  task  and  a  visual-manual  RT  task.29  The  experiment  included  three  trial 
types:  dual  task',  heterogeneous  single  task;  and  homogeneous  single  task.  On  dual-task  trials, 
participants  performed  both  tasks  simultaneously,  and  the  stimuli  for  them  had  a  zero  SOA.  On 
heterogeneous  single-task  trials,  only  one  stimulus  (either  auditory  or  visual)  was  presented,  and 
only  one  task  was  performed.  However,  the  heterogeneous  single-task  and  dual-task  trials  were 
interleaved  randomly  within  trial  blocks,  so  before  each  of  these  trials,  participants  were  uncertain 
about  which  task(s)  would  come  next.  In  contrast,  the  homogeneous  single-task  trials  were  arranged 
such  that  the  same  one  task  had  to  be  performed  on  each  trial  throughout  a  block.  Some  blocks  of 
homogeneous  single-task  trials  involved  the  auditory-vocal  task,  and  others  involved  the  visual- 
manual  task.  Although  the  various  trial  types  differed  substantially  in  their  nominal  demands  on 
participants'  information-processing  resources,  the  mean  RTs  and  error  rates  that  resulted  from  them 
were  nearly  equal  (Figure  8).  Furthermore,  the  individual  RTs  for  the  auditory-vocal  and  visual- 
manual  tasks  had  values  that  were  essentially  independent  of  each  other  within  the  dual-task  trials,  as 
perfect  time  sharing  would  entail.  This  outcome  casts  grave  doubt  on  the  traditional  RSB  hypothesis 
but  strongly  affirms  our  theoretical  claim  that  skilled  performers  can  test  the  conditions  and  execute 
the  actions  for  two  sets  of  task  production  rales  concurrently. 

The  results  in  Figure  8  also  raise  another  intriguing  question.  Why  have  past  attempts  of  some 
other  experimenters  (e.g.,  Pashler,  1994b;  Ruthruff,  Pashler,  &  Klaasen,  1995)  who  tried  to  "uncork" 
the  putative  response-selection  bottleneck  been  unsuccessful?  Perhaps  the  answer  is  that  their 
experiments  did  not  satisfy  one  or  more  of  the  prerequisite  conditions  for  perfect  time  sharing.  This 
seems  likely  because  Pashler's  (1994b)  experiment  required  participants  to  perform  two  manual  tasks 
concurrently,  and  Ruthruff  et  al.'s  (1995)  experiment  required  participants  to  produce  their  vocal  and 
manual  responses  in  grouped  fashion,  which  imposed  strong  constraints  on  the  temporal  relations 
among  responses.  Thus,  we  would  expect  that  Aese  experiments  might  produce  seductively 
misleading  evidence  of  central  response-selection  or  peripheral  perceptual-motor  bottlenecks.  The 
repeated  fulfillment  of  such  expectations  reinforces  our  prior  Methodological  Lesson  6:  "Be  careful 
what  you  ask  for;  you  might  actually  get  it".30 


For  the  auditory-vocal  task,  each  stimulus  was  either  a  220,  880,  or  3520  Hz  tone  to  which  participants  responded  by 
saying  either  the  word  "one",  "two",  or  "three",  respectively.  For  the  visual-manual  task,  each  stimulus  was  die  capital 
letter  "O"  displayed  directly  over  either  the  left,  middle,  or  right  dash  in  a  horizontal  row,  to  which  participants 
responded  by  pressing  a  key  with  either  the  index,  middle,  or  ring  finger  of  the  right  hand,  respectively.  These  tasks 
were  similar  to  ones  used  by  Pashler  (1990)  and  others  (e.g.,  McCann  &  Johnston,  1992)  in  testing  the  traditional  RSB 
hypothesis. 

Methodological  Lesson  6  also  is  reinforced  by  another  result  from  the  experiment  of  Schumacher  et  al.  (1997).  After 
our  evidence  of  virtually  perfect  time  sharing  (Figure  8)  had  been  collected,  we  tested  these  same  participants  under  the 
standard  PRP  procedure,  asking  them  to  treat  the  auditory-vocal  task  as  primary  and  the  visual-manual  task  as  secondary 
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Figure  8.  Mean  RTs  from  three  trial  types  in  an  experiment  that  demonstrates  virtually  perfect  time 
sharing  under  certain  prerequisite  conditions  (Schumacher  et  al.,  1997). 


This  yielded  mean  Task  1  RTs  that  remained  short  and  constant  regardless  of  the  SOA,  but  mean  Task  2  RTs  increased 
when  the  SOA  was  short,  substantially  exceeding  those  in  Figure  8.  The  latter  increase  would  be  expected  from 
Methodological  Lesson  6  because  the  PRP  procedure  encourages  participants  to  produce  their  Task  2  responses  after 
Task  1  responses,  which  requires  temporary  postponement  of  progress  on  Task  2  after  short  SOAs. 
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Methodological  Lesson  8:  Seek  and  you  shall  find;  knock  and  it  shall  be  opened  unto  you. 
Equally  relevant  at  this  juncture  is  an  eighth  methodological  lesson;  along  the  path  to  a  veridical 
UTC,  interesting  new  predictions  may  be  derived  theoretically  and  confirmed  empirically.  For 
example,  our  discoveries  of  systematic  individual  differences  in  dual-task  performance  under  the 
PRP  procedure  (Substantive  Lesson  3),  beneficial  effects  of  training  on  participants'  task-scheduling 
strategies  (Substantive  Lesson  5),  and  virtually  perfect  time  sharing  (Substantive  Lesson  6)  all  were 
stimulated  by  predictions  based  on  the  present  AEG  models  and  their  EPIC  architecture.  One  might 
therefore  be  optimistic  that  further  steps  toward  theoretical  unification  will  yield  more  successful 
predictions.  Indeed,  such  optimism  is  justified  by  results  from  subsequent  applications  of  EPIC  to 
modeling  multiple-task  performance  in  other  domains  beyond  the  PRP  procedure. 

Application  to  Human-Computer  Interaction 

Beyond  the  PRP  procedure,  one  practical  domain  to  which  we  have  applied  the  EPIC 
architecture  for  modeling  multiple-task  performance  is  human-computer  interaction  (Kieras  & 

Meyer,  1997;  Kieras,  Wood,  &  Meyer,  1995, 1997).  Financial  support  of  this  application  has  come 
from  NYNEX,  a  regional  telephone  company  in  the  northeastern  United  States.  Given  NYNEX's 
particular  objectives,  our  research  on  HCI  concerns  performance  by  telephone  operators  who 
provide  on-line  service  to  customers  making  phone  calls. 

EPIC  computational  models  are  especially  relevant  in  this  regard.  The  operators'  performance 
requires  processing  auditory  and  visual  information  from  headphones  and  CRT  displays, 
respectively.  On  the  basis  of  such  information,  operators  must  make  decisions  about  incoming  calls 
while  speaking  to  customers  and  typing  on  a  keyboard.  The  interactions  between  operators  and 
customers  are  controlled  by  computer  workstations.  Dealing  with  typical  calls  through  a  workstation 
involves  several  tasks  that  exercise  a  variety  of  perceptual-motor  and  cognitive  resources.  The  speed 
and  accuracy  of  an  operator  in  completing  each  call  depends  both  on  the  capacities  of  these 
resources  and  on  the  workstation's  design.  By  applying  EPIC  computational  models  to  describe  and 
predict  this  performance,  we  may  help  improve  the  design  of  workstations,  the  selection  of 
operators,  and  the  regimens  of  training  that  they  receive.^' 

The  prospects  for  such  help  are  good.  Empirical  studies  of  operators'  performance  have 
revealed  patterns  of  overt  movements  and  response  latencies  consistent  with  EPIC'S  assumptions. 
From  these  patterns,  it  appears  that  considerable  temporal  overlap  occurs  among  concurrent 
perceptual,  cognitive,  and  motor  processes  in  skilled  telephone  operators  (Gray,  John,  &  Atwood, 
1993;  John,  1990).  Thus,  further  substantive  and  methodological  lessons  may  emerge  through 
modeling  these  processes  in  more  detail  (Tables  1  and  2). 

The  NYNEX  Study 

An  initial  example  of  our  research  in  this  domain  focuses  on  performance  by  toll-assistance 
operators  (TAOs),  who  service  customers  for  telephone  calls  that  must  be  charged  to  third-party 
billing  numbers. 

Procedure.  During  such  calls,  NYNEX  investigators  have  made  audio-visual  tapes  of 
representative  on-line  exchanges  between  skilled  TAOs  and  customers.32  These  tapes  contain 
records  of  equipment  signals,  customers'  vocal  inputs,  TAOs'  vocal  outputs,  and  sequences  of 
manual  keypresses  produced  by  TAOs  while  they  were  using  standard  computer  workstations.  From 


The  benefits  from  this  application  could  be  substantial  to  NYNEX.  For  example,  an  improvement  in  the  design  of  an 
interface  that  reduces  the  average  completion  time  per  call  by  1  s  may  decrease  equipment  and  personnel  costs  on  the 
order  of  a  million  dollars  or  more  per  year. 

^2  We  thank  Michael  Atwood  of  the  NYNEX  Science  and  Technology  Center  for  providing  us  with  audio-visual  tapes  of 
TAOs’  on-line  performance.  Helpful  comments  by  Bonnie  John  and  Rory  Stuart  about  our  analyses  and  modeling  of  this 
performance  also  are  gratefully  acknowledged. 
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the  tapes,  the  identities  and  latencies  of  relevant  environmental  and  behavioral  events  can  be 
transcribed  for  data  analysis  and  theoretical  interpretation. 

Each  of  the  taped  calls  required  a  TAO  to  perform  several  steps:  (a)  detecting  the  onset  of  a  tone 
over  a  pair  of  headphones,  which  signalled  that  a  call  from  a  prospective  customer  had  arrived  at  the 
workstation;  (b)  looking  at  a  computer  display  screen  for  alphanumeric  information  that  specified 
the  call's  category;  (c)  greeting  the  customer  who  was  making  the  call;  (d)  getting  the  billing  number 
to  be  charged  for  the  call;  (e)  entering  this  number  and  other  relevant  information  into  the 
workstation  by  typing  a  series  of  keystrokes  on  the  computer's  keyboard;  (f)  looking  at  the  display 
screen  and  checking  that  the  typing  had  been  done  correctly;  (g)  completing  the  connection  for  the 
call  by  pressing  a  call-initiation  key;  and  (h)  bidding  the  customer  good  bye.  The  steps  performed 
by  the  TAOs  were  analogous  to  ones  that  participants  perform  repeatedly  during  PRP  and  serial 
choice-reaction  procedures.  It  therefore  might  be  expected  that  in  at  least  some  respects,  the  TAOs' 
response  latencies  would  resemble  those  found  under  laboratory  conditions. 

Results.  For  example.  Figure  9  shows  results  from  one  representative  exchange  between  a  TAO 
and  customer.  Here  observed  response  latencies  of  the  TAO's  manual  keystrokes  (large  filled  circles 
on  solid  curve)  are  plotted  versus  their  serial  positions  throughout  the  exchange.  Several  features  of 
these  data,  which  typify  third-party  billing  calls,  should  be  noticed.  At  the  start  of  the  keystroke 
sequence  (first  serial  position),  the  observed  response  latencies  rise  to  a  maximum  and  then 
gradually  decrease  thereafter  as  the  serial  positions  of  the  keystrokes  increase.  Consequently,  there 
is  a  downward  latency  trend  that  looks  much  like  the  RT  curves  found  during  past  studies  with  the 
PRP  procedure  (e.g..  Figure  2,  mean  Task  2  RTs),  suggesting  some  postponement  of  pending 
processes  in  order  for  current  processes  to  be  completed.  This  postponement  presumably  was 
necessary  so  that  the  TAO's  keystrokes  occurred  in  correct  serial  order,  just  as  instructions  for  the 
PRP  procedure  require  that  Task  2  responses  occur  after  Task  1  responses. 

The  pattern  of  observed  response  latencies  in  Figure  9  raises  intriguing  theoretical  questions. 
Which  type  of  task-scheduling  strategy  is  used  by  TAOs  to  coordinate  their  performance?  Are  their 
strategies  consistent  with  our  previous  findings  for  laboratory  studies  that  involved  the  PRP 
procedure?  How  might  the  modeling  of  TAOs'  performance  promote  the  search  for  veridical  UTCs? 
Can  workstation  designs  be  improved  on  the  basis  of  insights  from  such  modeling? 

EPIC  Computational  Models  ofTAOs'Petformance 

Some  strides  toward  answering  these  questions  can  be  taken  by  formulating  EPIC 
computational  models  that  account  more  or  less  well  for  data  like  those  in  Figure  9. 

Model  with  cautious  scheduling  strategy.  Among  the  specific  models  that  we  have  tested  thus 
far  is  one  that  uses  a  cautious  scheduling  strategy  (CSS).  According  to  this  CSS  model,  which  has 
an  artificial  response-selection  bottleneck  and  incorporates  the  traditional  RSB  hypothesis,  there  is 
no  temporal  overlap  in  the  response-selection  processes  whereby  TAOs  choose  their  successive 
overt  actions.  Nor  does  the  CSS  model  permit  temporal  overlap  between  the  overt  execution  of 
current  eye  or  hand  movements  and  the  covert  preparation  of  subsequent  eye  or  hand  movements. 
Rather,  the  CSS  model  assumes  that  serially  ordered  movements  are  mediated  by  strictly  sequential 
cognitive  and  motor  processes.  As  a  result,  the  simulated  response  latencies  that  are  produced  by  the 
CSS  model  fit  the  observed  response  latencies  poorly. 

For  example,  the  upper  dotted  curve  in  Figure  9  come  from  the  CSS  model.  Its  simulated 

response  latencies  greatly  exceed  the  TAO's  actual  response  latencies  =  0.04;  RMSE  =1150  ms), 
especially  for  keystrokes  in  early  serial  positions.  The  CSS  model  fails  because  unlike  real  TAOs,  it 
does  not  temporally  overlap  response-selection  or  movement-production  processes  for  successive 
ocular,  manual,  and  vocal  actions.  This  failure  happened  although,  in  other  respects,  we  formulated 
the  CSS  model  to  approximate  the  TAO's  actual  response  latencies  as  best  possible  despite  its 
cognitive  response-selection  bottleneck  and  cautious  scheduling  strategy. 
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Keystroke  Serial  Position 


Figure  9.  Response  latencies  as  a  function  of  keystroke  serial  position  for  a  sequence  of  keystrokes 
typed  by  a  TAO  during  a  representative  exchange  with  a  customer.  Large  filled  circles  on  the  solid 
curve  denote  observed  response  latencies.  Small  filled  circles  on  the  dashed  curve  denote  simulated 
latencies  from  an  EPIC  computational  model  that  used  a  daring  scheduling  strategy.  The  upper 
dotted  curve  denotes  simulated  latencies  from  another  model  that  had  an  artificial  response-selection 
bottleneck  and  used  a  cautious  scheduling  strategy. 


39 


Precis  to  A  Unified  Theory 


Meyer  &  Kieras 


Model  with  daring  scheduling  strategy.  Another  model  that  we  have  tested  in  this  context  is 
one  that  uses  a  daring  scheduling  strategy  (DSS).  According  to  this  DSS  model,  which  generalizes 
our  previous  SRD  model  from  the  PRP  procedure  (Figure  4),  the  TAOs'  successive  overt  actions  are 
mediated  by  response-selection  and  movement-production  processes  that  have  substantial  temporal 
overlap.  As  a  result,  the  simulated  response  latencies  that  are  produced  by  the  DSS  model  fit  the 
observed  response  latencies  fairly  well. 

For  example,  the  lower  dashed  curve  in  Figure  9  comes  from  the  DSS  model.  Its  simulated 
response  latencies  approximate  the  TAO's  actual  response  latencies  closely  at  almost  every  serial 

position  {R^  =  0.93;  RMSE  =  95  ms).33  Of  course,  the  success  of  the  DSS  model  would  be  expected 
from  our  previous  findings  for  the  PRP  procedure,  where  temporally  overlapping  stimulus- 
identification,  response-selection,  and  movement-production  processes  were  demonstrated  through 
the  SRD  model's  goodness-of-fit  (e.g..  Figure  2).  In  the  present  context,  such  temporal  overlap  may 
occur  again  because  TAOs  have  substantial  job  experience  and  want  to  complete  phone  calls  quickly 
so  that  customers  are  satisfied  and  business  costs  stay  low. 

Implications  for  interface  design.  It  is  beyond  the  scope  of  this  chapter  to  discuss  at  length 
how  designers  can  improve  current  computer  workstations  on  the  basis  of  results  like  those  in 
Figure  9.  Nevertheless,  a  few  general  comments  that  bear  on  workstation  design  may  be  offered 
here  (also  see  Gray  et  al.,  1993;  Kieras  et  al,  1997). 

From  Figure  9,  it  appears  that  TAOs  can  work  on  multiple  tasks  concurrently  at  procedural 
cognitive  and  perceptual-motor  levels.  Workstation  interfaces  therefore  should  be  designed  to 
exploit  this  capacity  for  daring  task  scheduling.  For  example,  such  scheduling  may  be  facilitated  by 
designing  interfaces  that  interleave  stimuli  and  responses  in  visual-manual  tasks  with  stimuli  and 
responses  in  auditory- vocal  tasks.  Also  beneficial  may  be  displays  that  present  visual  stimuli  for 
successive  tasks  at  predictable  adjacent  spatial  locations.  These  design  principles  are  important 
because  our  findings  show  them  to  be  prerequisites  for  concurrent  performance  of  multiple  tasks. 

Another  related  point  concerns  the  prospective  role  of  computational  models  in  evaluating 
workstation  interfaces.  Our  results  (e.g..  Figure  9)  show  that  models  with  a  systematic  functional 
^chitecture  for  the  human  information-processing  system  can  precisely  predict  major  facets  of 
interface  users'  performance  (e.g.,  response  speed  and  accuracy).  In  the  future,  as  such  models 
evolve  further,  a  priori  predictions  may  be  made  with  respect  to  both  performance  achieved  through 
different  interfaces  and  performance  achieved  by  different  personal  styles  of  operation.  Thus, 
interface  and  personnel  testing  could  become  both  automated  and  theoretically  motivated  in  ways 
that  save  much  time  and  effort  compared  to  current  engineering  practice. 

Lessons  from  Modeling  of  TAOs'  Performance 

Given  our  theoretical  research  on  the  performance  of  TAOs,  several  more  substantive  and 
methodological  lessons  may  be  educed  (Tables  1  and  2). 

Substantive  Lesson  7:  TAOs  know  the  way.  As  demonstrated  by  our  previous  research  with  the 
PRP  procedure,  the  way  to  perform  multiple  tasks  efficiently  is  by  daring  task  scheduling.  Such 
scheduling  can  occur  because  there  is  no  stmctural  cognitive  bottleneck  through  which  individual 
responses  must  be  selected  sequentially.  On  the  contrary,  the  human  cognitive  processor  -  like 
EPIC'S  “  has  the  capacity  to  select  responses  for  multiple  tasks  concurrently.  Nor  is  there  a  single 
bottleneck  mechanism  through  which  every  selected  response  must  be  produced  overtly.  Rather, 
each  response  modality  has  its  own  motor  processor;  while  the  physical  execution  of  various 


33  The  grand  mean  of  the  observed  response  latencies  in  Figure  9  is  about  1 100  ms.  Relative  to  this  baseline,  the  RMSE 
of  95  ms  constitutes  an  8.5%  error  of  prediction  by  the  DSS  model.  When  working  in  realistic  domains,  engineers 
typically  consider  theoretical  models  to  be  practically  useful  when  they  can  predict  observed  numerical  values  with 
margins  of  error  less  than  10%  (Card  et  al.,  1983).  The  DSS  model  satisfies  this  criterion  whereas  the  CSS  model  does 
not.  Furthermore,  the  fit  of  the  DSS  model  seems  satisfactory  because  the  response  latencies  in  Figure  9  come  from 
single  keystrokes.  Thus,  the  goodness-of-fit  here  is  about  what  one  might  expect  if  the  model  were  correct  but  each 
response  latency  also  contained  approximately  a  10%  contribution  from  perceptual  and  motor  "noise",  which  would  be 
typical  of  practiced  performers  such  as  TAOs. 
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movements  is  underway,  these  motor  processors  may  prepare  the  movement  features  for  subsequent 
overt  responses  anticipatorily.  The  success  of  the  present  DSS  model  in  characterizing  performance 
during  HCI  shows  that  TAOs  also  know  the  way  to  use  such  capabilities. 

Substantive  Lesson  8:  What  goes  around  comes  around.  Complementing  Substantive 
Lesson  7  is  an  eighth  correlative  one.  Through  our  modeling  of  multiple-task  performance  under 
both  simple  laboratory  and  complex  real-world  conditions,  it  has  become  apparent  that  theoretical 
conclusions  reached  in  one  context  often  may  be  generalized  to  other  seemingly  quite  different 
contexts.  For  example,  conclusions  reached  for  the  PRP  procedure  also  apply  to  realistic  human- 
computer  interaction,  and  vice  versa.  Such  generality  should  be  especially  reassuring  to  seekers  of  a 
complete  veridical  UTC. 

Substantive  Lesson  9:  Hand  movements  obey  the  Boy  Scout  motto.  The  motto  of  the  Boy 
Scouts  is  "Be  prepared".  Scouts  who  abide  by  it  are  supposed  to  be  prepared  beforehand  for  making 
fast  and  accurate  responses  under  various  conditions.  Similarly,  our  modeling  of  TAOs'  response 
latencies  (Figure  9)  suggests  that  their  impending  hand  and  eye  movements  are  prepared 
anticipatorily  while  current  overt  responses  are  underway.  It  is  this  anticipatory  movement 
preparation  that  contributes  in  part  to  the  daring  scheduling  strategies  with  which  TAOs  achieve 
efficient  multiple-task  performance  (Kieras  et  al.,  1997).  For  example,  such  preparation  helps 
increase  the  rates  of  manual  typing  and  visual  search  (cf.  Rumelhart  &  Norman,  1982). 

Substantive  Lesson  10:  Practice  makes  (nearly)  perfect.  Consistent  with  our  prior  findings 
about  practice  effects  in  the  PRP  procedure  (Figure  7),  the  efficient  performance  of  TAOs  may  stem 
from  their  extensive  experience  at  servicing  various  types  of  phone  calls  under  conditions  that 
encourage  daring  task  scheduling.  Given  this  experience,  adaptive  executive  processes  can  be 
evolved  to  enable  substantial  flexibility  and  temporal  overlap  among  response  selection,  movement 
production,  and  other  operations  for  concurrent  tasks  (cf.  Gopher,  1993).  Persistent  dual-task 
interference,  which  sometimes  lasts  for  thousands  of  trials  in  the  standard  PRP  procedure,  and  which 
has  misled  some  advocates  of  the  traditional  RSB  hypothesis  (e.g.,  Gottsdanker  &  Stelmach,  1971; 
Pashler,  1993),  is  merely  a  deceptive  stereotypy  due  to  the  constant  differential  task  priorities 
imposed  by  the  PRP  procedure's  instructions. 

Methodological  Lesson  9:  Unification  enables  application.  For  researchers  and  practitioners 
still  uncertain  about  what  approach  ultimately  may  lead  to  the  greatest  practical  payoffs  in 
experimental  psychology,  cognitive  science,  and  human-factors  engineering,  the  extensions  of  our 
EPIC  computational  models  from  the  PRP  procedure  to  the  HCI  tasks  of  TAOs  offer  a  ninth 
instmctive  methodological  lesson.  Efforts  toward  theoretical  unification  and  the  development  of 
UTCs  can  facilitate  the  transfer  of  data  and  hypotheses  from  the  laboratory  to  applied  settings. 
Unification  enables  application.  Of  course,  Newell  (1990)  anticipated  this  in  his  previous 
advertisements  for  UTCs,  and  he  was  correct,  unlike  some  less  optimistic  scientific  leaders  in  the 
field. 

Methodological  Lesson  10:  Keep  your  sunny  side  up.  Indeed,  past  pessimism  has  been  rather 
pervasive  about  applying  laboratory  data  and  hypotheses  to  more  complex  realistic  situations.  For 
example,  consider  what  the  founder  of  the  Attention  and  Performance  symposium  series  had  to  say 
with  respect  to  these  matters; 

"...  generalizations  from  the  simple  to  the  complex  are  sometimes  straightforwardly  wrong.  As 
an  example,  one  cannot  build  a  predictive  model  about,  say,  how  people  type  or  play  the  piano 
on  the  basis  of  the  results  from  research  on  responses  to  signals  that  are  presented  in  rapid 
succession  (the  psychological  refractory  period,  Welford,  1967).  ...  the  "small"  paradigm ...  is 
merely  concerned  with  artificially  created  situations,  irrelevant  to  real-life  performance  and, 
hence,  to  system  design.  ...  the  general  message  [therefore]  is  that  we  should  aim  at  delineating 
the  range  of  application  of  [micro]  behavioural  models  rather  than  searching  for  the  one  ultimate 
[macro]  model"  (Sanders,  1991,  pp.  1006-1008;  also  see  Sanders,  1984). 

In  our  opinion,  however,  such  assessments  and  prescriptions  are  too  pessimistic  and  restrictive. 
On  the  contrary,  applications  of  EPIC  computational  models  jointly  to  the  PRP  procedure  and  HCI 
demonstrate  clearly  that  data  and  theoretical  concepts  from  the  former  "artificial"  laboratory  domain 
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transfer  successfully  to  the  latter  "natural"  real-world  domain.  Although  Sanders  (1991)  merits  high 
respect,  he  was  wrong  about  the  impossibility  of  this  transfer  and  about  the  inadvisability  of  seeking 
a  practical  predictive  (macro  model)  UTC. 

Which  brings  us  to  our  tenth  methodological  lesson:  "Keep  your  sunny  side  up".  Future 
investigators  should  be  optimistic  about  both  the  scientific  and  practical  prospects  for  UTCs.  There 
is  considerable  reason  to  expect  that  further  strides  toward  theoretical  unification  of  basic 
psychological  laboratory  data  and  conceptual  hypotheses  will  promote  applied  human-factors 
engineering  significantly.  In  fact,  more  reason  for  such  optimism  is  provided  by  our  next  application 
of  EPIC  computational  models  to  a  second  real-world  task  domain. 


Application  to  Aircraft  Cockpit  Operation 

A  second  practical  application  of  EPIC  computational  models  deals  with  multiple-task 
performance  during  cockpit  operations  in  military  aircraft  (Kieras  &  Meyer,  1995, 1997;  Meyer  & 
Kieras,  1996, 1997b).  Our  work  there  has  been  conducted  collaboratively  with  scientists  at  the 
Naval  Research  Laboratory  (NRL)  in  Washington,  DC.  This  collaboration  extends  previous 
laboratory  studies  of  visual-manual  tracking  and  serial  choice  reactions  (e.g.,  Brickner  &  Gopher, 
1981;  Gopher,  1993;  Gopher,  Brickner,  &  Navon,  1982;  McLeod,  1977;  North,  1977;  Wickens, 
1976).  Given  NRL's  particular  objectives,  we  have  modeled  the  performance  of  personnel  who 
must  do  visual-manual  tracking  and  tactical  decision  making  concurrently  under  realistic  cockpit 
conditions.  As  in  other  related  contexts  (e.g.,  HCI  by  the  TAOs  of  NYNEX),  such  performance 
relies  on  executive  cognitive  processes  to  coordinate  ocular  and  manual  motor  processes  with  visual 
and  auditory  perceptual  processes.  By  applying  EPIC  computational  models  to  characterize  these 
processes,  we  may  help  improve  the  design  of  cockpit  control  panels,  the  selection  of  personnel,  and 
the  regimens  of  training  that  they  receive.  The  resulting  benefits  could  be  relevant  to  performance  in 
other  real-world  situations  as  well  (e.g.,  concurrent  automobile  driving  and  cellular  telephoning; 
Gugerty,  1997). 

The  NRL  Study 

The  empirical  data  on  which  we  focus  for  now  were  collected  at  the  NRL  by  Balias,  Heitmeyer, 
and  Perez  (1992). 

Procedure.  During  Balias  et  al.'s  (1992)  study,  participants  -  including  some  trained  pilots  - 
viewed  and  responded  to  a  computerized  visual  display  similar  to  ones  used  in  military  aircraft 
cockpits.  A  diagram  of  this  display  appears  in  Figure  10.  It  provided  information  for  performing 
two  tasks  concurrently. 

On  the  right  side  of  the  display  was  a  window  for  a  visual-manual  tracking  task.  In  this  window 
were  a  cursor  and  a  target  (iconic  airplane)  that  moved  evasively  through  space.  When  participants 
performed  the  tracking  task,  they  had  to  keep  the  cursor  on  target  by  moving  a  right-hand  joystick 
that  controlled  the  cursor's  spatial  position.  The  tracking  error  (distance  between  cursor  and  target) 
was  measured  as  a  function  of  the  tracking  task's  difficulty. 

Meanwhile  on  the  left  side  of  the  display  was  a  window  for  a  tactical-decision  task.  In  this 
window  were  iconic  blips  that  appeared  sequentially  at  unpredictable  times  and  locations,  depicting 
potentially  hazardous  objects  (e.g.,  jet  fighters,  bombers,  and  missle  sites)  whose  locations  changed 
gradually  over  time.  When  participants  performed  the  tactical-decision  task,  they  looked  at  these 
blips  one  after  another  and  indicated  which  ones  were  "hostile"  and  which  ones  were  "neutral"  by 
typing  on  a  keyboard  with  their  left  hands.  Response  latencies  of  the  keypresses  were  measured 
individually  for  the  blips  as  a  function  of  their  serial  positions  in  the  sequence  of  tactical  decisions.^^ 


The  response  latency  for  a  blip  equaled  the  amount  of  time  between  two  successive  events:  (a)  the  color  of  the  blip 
changed  from  black  to  red,  blue,  or  amber;  and  (b)  a  key  was  pressed  to  indicate  the  blip's  hostility  status.  Red  blips  had 
to  be  classified  as  "hostile";  blue  blips  had  to  be  classified  as  "neutral";  amber  blips  had  to  be  classified  as  either 
"hostile"  or  "neutral"  in  terms  of  their  position,  direction,  or  speed  of  movement  on  the  display  screen. 
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Figure  10.  Diagram  of  the  visual  display  used  by  Balias  et  al.  (1992)  in  their  study  of  realistic  dual¬ 
task  performance  during  aircraft  cockpit  operation.  On  the  left  is  the  window  for  a  tactical  decision 
task.  There  the  iconic  blips  represent  potentially  dangerous  objects  that  had  to  be  classified  as 
"hostile"  or  "neutral"  .  On  the  right  is  the  window  for  a  visual-manual  tracking  task.  There  the 
crosshairs  of  a  cursor  had  to  be  superimposed  on  a  moving  target  plane  by  manipulating  a  right-hand 
joystick. 
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Performance  of  the  tactical-decision  and  visual-manual  tracking  tasks  occurred  during  two  types 
of  epoch:  single  task,  and  dual  task.  During  single-task  epochs,  participants  had  to  perform  a 
relatively  hard  version  of  the  tracking  task  for  2  min  while  disregarding  the  tactical-decision  task. 
During  dual-task  epochs,  the  tactical-decision  task  and  a  relatively  easy  version  of  the  tracking  task 
both  had  to  be  performed  concurrently  for  2  min,  with  emphasis  on  making  the  tactical  decisions 
quickly.  Participants  alternated  back  and  forth  between  successive  single-task  and  dual-task  epochs. 
Each  transition  between  epochs  was  signaled  by  a  brief  auditory  tone,  with  no  time  separating  the 
end  of  one  epoch  from  the  beginning  of  the  next.  The  sequence  of  alternating  epochs  included  six 
single-task  and  six  dual-task  epochs  per  participant. 

Through  measurements  of  participants'  performance  after  each  transition  from  a  single-task  to 
dual-task  epoch,  Balias  et  al.  (1992)  studied  the  effects  of  adaptive  automation  on  aircraft  cockpit 
operations.  By  definition,  adaptive  automation  involves  two  principal  elements.  First,  a  computer 
system  takes  temporary  responsibility  for  performing  one  or  more  tasks  (e.g.,  tactical  decision 
making)  sO  a  human  operator  can  concentrate  on  another  task  (e.g.,  visual-manual  tracking)  that  has 
become  especially  difficult  at  the  time.  Second,  when  the  task  on  which  the  operator  has  been 
concentrating  becomes  easier  subsequently,  the  operator  continues  performing  it  and,  in  addition, 
resumes  performing  the  other  task(s)  for  which  the  computer  took  temporary  responsibility  before. 
Human-factors  engineers  hypothesize  that  adaptive  automation  can  ease  the  operator’s  mental 
workload  during  stressful  time  periods  while  maximizing  overall  performance  across  a  variety  of 
environmental  conditions. 

However,  adaptive  automation  also  can  have  short-term  detrimental  effects  on  performance. 

For  a  while  after  the  transition  from  an  epoch  of  single-task  performance  to  an  epoch  of  dual-task 
performance,  responses  to  stimuli  in  the  recently  resumed  task  (e.g.,  tactical  decision  making)  may 
be  relatively  slow  or  inaccurate,  manifesting  an  automation  deficit  until  the  operator  "gets  back  into 
the  swing  of  things".  The  causes  of  automation  deficits  are  not  well  understood  yet;  they  perhaps 
involve  temporary  losses  of  situation  awareness  (Graves,  1997;  Gugerty,  1997),  transient  PRP 
effects  of  cautious  task  scheduling,  and/or  disruptive  phasic  changes  in  operators'  level  of  arousal. 
Given  the  design  of  Balias  et  al.'s  (1992)  study,  we  can  examine  the  contributions  that  some  of  these 
sources  make.  Specifically,  by  applying  EPIC  computational  models  to  account  for  data  from  this 
study,  it  is  possible  to  test  whether  an  automation  deficit  stems  from  cautious  scheduling  strategies  in 
the  aftermath  of  transitions  between  single-task  and  dual-task  epochs. 

Results.  Some  empirical  results  for  making  such  tests  appear  in  Figure  1 1.^5  Here  observed 
mean  response  latencies  for  the  tactical-decision  task  are  shown  versus  the  serial  positions  in  which 
iconic  blips  were  classified  following  the  transitions  from  single-task  to  dual-task  epochs.  The 
observed  latencies  tended  to  be  longer  at  the  start  of  the  blip  sequence,  manifesting  an  initial 
automation  deficit,  after  which  they  decreased  as  the  blip  serial  position  increased.  This  latency 
decrease  over  serial  positions  formed  a  downward  curve  that  embodied  a  PRP-like  effect, 
reminiscent  of  mean  Task  2  RTs  in  the  standard  PRP  procedure  (cf.  Figure  2). 

EPIC  Computational  Models  of  Aircraft  Cockpit  Operation 

With  respect  to  the  observed  response  latencies  in  Figure  1 1,  we  have  tested  two  alternative 
EPIC  computational  models:  one  with  a  cautious  scheduling  strategy  (CSS),  and  one  with  a  daring 
scheduling  strategy  (DSS).  Task  scheduling  in  each  model  was  constrained  because  a  large  visual 
angle  (on  the  order  of  20°)  separated  the  centers  of  the  display  windows  for  the  tactical-decision  and 
visual-manual  tracking  tasks  (Figure  10).  This  separation  precluded  participants  from  performing 
more  than  one  task  at  any  particular  moment  during  the  dual-task  epochs,  because  their  eyes  had  to 
fixate  on  one  window  or  the  other  to  acquire  detailed  visual  information  for  the  task  subserved  by 
that  window.  Thus,  under  both  the  CSS  and  DSS  models,  our  simulations  during  dual-task  epochs 
alternated  back  and  forth  between  performing  the  tactical-decision  task  and  performing  the  visual- 


We  thank  James  Balias  and  his  colleagues  at  the  NRL  for  generously  providing  us  with  their  data  and  other  helpful 
information. 
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Blip  Serial  Position 


Figure  11.  Mean  response  latencies  as  a  function  of  blip  serial  position  for  the  tactical-decision  task 
in  the  study  by  Balias  et  al.  (1992).  Large  filled  circles  on  the  solid  curve  denote  observed  latencies. 
Smaller  filled  circles  on  the  lower  dashed  curve  denote  simulated  latencies  from  an  EPIC 
computational  model  that  used  a  daring  scheduling  strategy.  The  upper  dotted  curve  denotes 
simulated  latencies  from  another  model  that  had  an  artificial  response-selection  bottleneck  and  used 
a  cautious  scheduling  strategy.  All  three  latency  curves  come  fi'om  a  sequence  of  tactical  decisions 
made  iimnediately  after  a  period  during  which  only  the  visual-manual  tracking  task  had  been 
performed. 
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manual  tracking  task,  mediated  by  top-level  executive  processes  that  permitted  the  gaze  of  EPIC's 
eyes  to  be  shifted  between  the  left  and  right  display  windows. 

However,  the  two  models  differed  considerably  in  their  assumptions  about  the  processes  that 
performed  the  tactical-decision  task  while  it  was  underway.  Under  the  CSS  model,  there  was  an 
intermediate-level  executive  process  that  used  a  cautious  scheduling  strategy  with  little  temporal 
overlap  among  the  processes  for  making  tactical  decisions  about  the  individual  iconic  blips  in  this 
task's  display  window.  By  contrast,  under  the  DSS  model,  the  intermediate-level  executive  process 
for  the  tactical-decision  task  used  a  daring  scheduling  strategy  whereby  more  temporal  overlap 
occurred  among  the  processes  for  making  tactical  decisions  about  individual  blips. 

CSS  model.  Specifically,  we  programmed  the  CSS  model  with  an  artificial  response-selection 
bottleneck  that  precluded  manual  keypress  responses  for  multiple  iconic  blips  from  being  selected 
concurrently  while  performance  of  the  tactical  decision  task  was  underway.  Also,  under  the  CSS 
model,  processes  that  chose  the  next  blip  to  be  classified  and  that  shifted  EPIC’s  eyes  to  fixate  on  it 
for  stimulus  identification  could  not  begin  until  after  manual  responses  for  the  currently  attended 
blip  had  been  selected  and  produced.  As  a  result,  the  upper  dotted  curve  of  simulated  mean  response 
latencies  in  Figure  1 1  was  produced  for  the  tactical-decision  task  during  dual-task  epochs. 

The  CSS  model's  simulated  latencies  exceeded  the  observed  latencies  (R^  =  0.869;  RMSE  = 

439  ms),  especially  for  blips  in  early  serial  positions.  This  overshoot  happened  although  subject  to 
ancillary  architectural  constraints,  the  CSS  model  was  programmed  to  approximate  the  observed 
latencies  as  best  possible.  Our  simulations  suggest  that  there  is  no  plausible  way  in  which 
participants  could  have  performed  the  tactical-decision  task  as  well  as  they  did  if  their  performance 
had  been  limited  by  a  structural  response-selection  bottleneck  or  cautious  scheduling  strategy. 

DSS  model.  In  contrast,  the  DSS  model  and  its  daring  scheduling  strategy  were  programmed 
such  that  no  response-selection  bottlenecks  precluded  manual  keypress  responses  for  multiple  iconic 
blips  from  being  selected  concurrently  while  performance  of  the  tactical  decision  task  was 
underway.  Also,  under  the  DSS  model,  processes  that  chose  the  next  blip  to  be  classified  and  that 
shifted  EPIC's  eyes  to  fixate  on  it  for  stimulus  identification  began  before  movement  production  for 
the  previously  attended  blip  had  finished.  As  a  result,  the  lower  dashed  curve  of  simulated  mean 
response  latencies  in  Figure  1 1  was  produced  for  the  tactical-decision  task.  Here  the  simulated  and 
observed  response  latencies  are  fairly  close  at  almost  every  serial  position  (R^  =  0.975;  RMSE  = 

90  ms).  The  DSS  model's  goodness-of-fit  in  Figure  1 1  is  similar  to  what  we  obtained  before  for  the 
PRP  procedure  (Figure  2)  and  for  TAOs'  performance  in  HCI  (Figure  9).  Furthermore,  this  model 
accounts  fairly  well  for  participants'  degrees  of  error  in  the  visual-manual  tracking  task  during  both 
single-task  and  dual-task  epochs.  It  therefore  appears  that  EPIC  may  be  applied  for  characterizing 
and  predicting  multiple-task  performance  across  various  realistic  task  domains. 

Implications  for  design  of  cockpit  control  panels.  Results  from  the  present  application  of  our 
EPIC  computational  models  suggest  that  automation  deficits  in  aircraft  cockpit  operations  are  not 
caused  simply  by  cautious  task  scheduling  after  transitions  from  single-task  to  dual-task  epochs. 
Instead,  perhaps  automation  deficits  occur  because  the  visual  displays  for  suspended  tasks  do  not 
sustain  situation  awareness  enough  during  single-task  epochs.  Without  such  sustenance,  operators 
may  need  extra  time  at  the  start  of  dual-task  epochs  in  order  to  create  new  mental  representations  of 
the  current  environment  for  formerly  suspended  tasks. 

If  so,  then  our  results  have  some  obvious  implications  for  designing  cockpit  control  panels: 

(a)  Distances  between  the  display  windows  of  current  and  temporarily  suspended  tasks  should  be 
minimized  while  providing  adequate  discriminability  among  visual  stimuli,  as  in  advanced  "head  up" 
displays,  (b)  Supplementary  perceptual  aids  that  help  sustain  situation  awareness  for  suspended 
tasks  should  be  provided,  (c)  At  the  start  of  dual-task  epochs,  special  visual  cues  should  be  available 
to  orient  operators'  attention  automatically  toward  the  most  important  stimuli  for  resuming  formerly 
suspended  tasks  Perhaps  implementing  such  guidelines  in  combination  will  enhance  the  utility  of 
semi-automated  aircraft  cockpits. 


36  For  example,  in  the  display  of  Balias  et  al.'s  (1992)  tactical-decision  task,  the  iconic  blip  that  should  get  classified  first 
after  the  start  of  each  dual-task  epoch  could  be  blinked  rapidly  on  and  off,  thereby  orienting  the  operator's  attention  to  it 
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Lessons  from  Modeling  Aircraft  Cockpit  Operation 

More  generally,  theoretical  and  practical  benefits  also  may  accrue  from  some  further 
methodological  and  substantive  lessons  that  modeling  of  aircraft  cockpit  operation  has  taught  us. 

Substantive  Lesson  11:  Daring  task  scheduling  is  The  Right  Stuff.  Our  eleventh  substantive 
lesson  concerns  the  nature  of  "The  Right  Stuff  (Wolfe,  1979).  According  to  aviation  lore,  "The 
Right  Stuff  is  what  test  pilots.  Top  Gun  aviators,  and  astronauts  must  have  to  survive  hazardous 
flight  operations.  The  success  of  the  present  DSS  model  suggests  that  at  least  part  of  "The  Right 
Stuff  involves  an  ability  to  use  daring  scheduling  strategies  for  concurrent  performance  of 
perceptual-motor  and  cognitive  tasks  under  duress.  Without  such  task  scheduling,  test  pilots,  Top 
Guns,  and  astronauts  could  not  survive  the  crises  that  they  encounter,  because  it  would  be  impossible 
for  them  to  complete  all  of  their  crucial  tasks  before  fatal  deadlines  have  passed.  That  pilots  like  the 
legendary  Chuck  Yeager  have  survived  to  a  ripe  old  age  weighs  heavily  against  the  traditional  RSB 
hypothesis  (cf.  Yeager  &  Janos,  1985). 

Substantive  Lesson  12:  The  eyes  have  it.  Due  to  the  complex  tasks  performed  in  rapid  aircraft- 
cockpit  operations,  the  daring  scheduling  strategies  used  for  them  presumably  involve  temporal 
overlap  among  a  panoply  of  mental  and  physical  processes,  including  ocular-response  selection, 
saccadic  eye-movement  execution,  visual  stimulus  encoding,  manual-response  selection,  and  finger 
keypressing.  Specifically,  our  successful  DSS  model  implies  that  ocular-response  selection  and 
saccadic  eye-movement  execution  must  overlap  these  other  processes  in  order  to  account  for 
observed  response  latencies;  otherwise,  simulated  latencies  would  far  exceed  observed  latencies. 

This  implication  leads  directly  to  our  twelvth  substantive  lesson:  "The  eyes  have  if,  where  "it"  is 
"The  Right  Stuff  associated  with  daring  task  scheduling.  Contrary  to  claims  by  some  advocates  of 
the  traditional  RSB  hypothesis  (e.g.,  Pashler  Carrier,  &  Hoffman,  1993),  we  have  found  no  evidence 
that  a  structural  response-selection  bottleneck  constrains  ocular-response  selection  for  voluntary 
saccadic  eye  movements. 

Methodological  Lesson  11:  If  you've  seen  one,  you've  seen  'em  all.  Given  the  similarity 
between  what  we  have  found  in  modeling  aircraft  cockpit  operation  and  other  related  cases  of 
multiple-task  performance  (e.g.,  HCI,  and  PRP  procedure),  an  eleventh  methodological  lesson  now 
becomes  apparent.  Having  seen  how  multiple-task  performance  is  achieved  under  one  set  of 
circumstances  may  provide  deep  insights  into  how  it  is  achieved  under  many  other  circumstances. 
Methodological  Lesson  11  therefore  constitutes  a  welcome  generalization  of  our  previous 
Substantive  Lesson  8  ("What  goes  around,  comes  around")  and  Substantive  Lesson  12  ("The  eyes 
have  if).  Seekers  of  theoretical  unification  should  find  this  modicum  of  generality  encouraging, 
because  it  is  an  important  prerequisite  to  formulating  elegant  parsimonius  veridical  UTCs. 

Substantive  Lesson  13:  Covert  visual  attention  shifts  are  like  poor  Yorick.  Nevertheless,  there 
may  be  a  few  exceptions  to  some  of  the  previous  lessons.  In  particular,  our  modeling  of  aircraft 
cockpit  operation  suggests  that  perhaps  Substantive  Lesson  8  and  Methodological  Lesson  11  do  not 
hold  with  respect  to  covert  shifts  of  visual  attention.  By  definition,  covert  visual  attention  shifts 
supposedly  take  place  in  the  "mind's  eye"  rather  than  the  body's  eyes  (Jonides,  1980, 1981);  they 
entail  mentally  redirecting  the  focus  of  attention  from  one  spatial  location  to  another  without 
concomitant  overt  eye  movements.  Such  attention  shifting  has  been  studied  extensively  in  artifical 
laboratory  experiments,  and  substantial  evidence  suggests  that  it  may  influence  visual  information 
processing  there  (Johnston  &  Dark,  1986;  Mangun,  Hillyard,  &  Luck,  1993;  Posner,  1980). 

However,  in  the  DSS  model  for  concurrent  tactical  decision  making  and  visual-manual  tracking,  no 
covert  visual  attention  shifts  contributed  to  its  daring  scheduling  strategy.  This  model  assumed 
simply  that  shifts  of  visual  attention  took  place  through  overt  saccadic  eye  movements  from  one 
relevant  stimulus  location  to  the  next.  Accounting  for  observed  response  latencies  and  tracking 
errors  did  not  require  covert  attention  shifts. 

The  lesson  from  this  is  that  under  some  -  perhaps  even  most  -  real-world  circumstances,  covert 
visual  attention  shifts  are  like  poor  Yorick,  the  deceased  king's  jester  in  Hamlet  (Shakespeare, 


and  helping  him  or  her  make  a  more  rapid  sequence  of  tactical  decisions. 
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1623/1992).  Although  experimental  psychologists  have  known  them  well,  such  attention  shifts  may 
play  little  if  any  role  for  present  practical  purposes.  Instead,  perhaps  what  matter  more  are  the 
ocular-motor  processes  whereby  overt  saccadic  eye  movements  take  place  to  shift  the  eyes'  foveas 
and  gaze  between  locations.^^ 

Methodological  lesson  12:  Be  thankful  for  the  Second  Golden  Rule.  Mandates  from 
organizations  such  as  NYNEX  and  the  U.  S.  Office  of  Naval  Research  (ONR),  the  financial  sponsors 
of  our  work,  also  have  taught  us  that  researchers  should  be  thankful  for  the  Second  Golden  Rule. 

The  Second  Golden  Rule,  an  old  practical  maxim,  states  the  blunt  reality  that  "Those  who  have  the 
gold  make  the  rules."  For  us,  this  has  meant  that  we  have  had  to  focus  on  helping  solve  our 
sponsors'  practical  problems.  As  a  result,  our  research  has  yielded  instructive  insights  about  major 
empirical  phenomena  and  theoretical  processes  in  both  "artificial"  laboratory  and  "natural"  real- 
world  contexts  that  otherwise  would  have  gone  unnoticed.  We  therefore  concur  with  Newell  (1990) 
that  the  press  of  practical  applications  can  have  substantial  benefits  to  basic  science. 


Conclusion 

Reviewing  the  lessons  that  our  computational  modeling  of  human  multiple-task  performance 
has  taught  us,  we  see  that  they  fall  into  several  subcategories. 

Aniong  the  methodological  lessons  (Table  1),  some  concern  attitudes  and  intellectual 
orientations  that  theorists  should  adopt  while  seeking  a  practical  unified  theory  of  cognition  and 
action  (e.g..  Methodological  Lessons  1,  2,  3,  5,  10,  and  12).  Other  methodological  lessons  are 
warnings  about  perilous  pitfalls  along  the  way  to  a  veridical  UTC  (e.g..  Methodological  Lessons  6 
and  7).  In  compensation  for  heeding  such  warnings  and  maintaining  proper  attitude,  additional 
methodological  lessons  offer  promisory  notes  about  major  benefits  that  may  accrue  as  a  result  (e.g.. 
Methodological  Lessons  4,  8,  9,  and  11).  We  hope  that  together,  these  lessons  will  both  encourage’ 
and  guide  researchers  toward  successful  unification  in  human-performance  theory. 

Also  relevant  to  theoretical  unification  are  the  concomitant  substantive  lessons  (Table  2). 
Among  them,  some  describe  basic  characteristics  of  the  human  information-processing  system  that 
have  become  salient  through  our  work  and  that  must  be  incorporated  in  the  functional  architecture  of 
a  veridical  UTC  (e.g..  Substantive  Lessons  1,  2,  6,  and  12).  Other  substantive  lessons  summarize 
important  facts  about  people's  preferred  task-scheduling  strategies  (e.g..  Substantive  Lessons  3,  4,  7, 
8,  9,  11,  and  75).  Given  such  facts,  it  may  be  possible  to  improve  multiple-task  performance 
considerably  with  systematic  training  (Substantive  Lessons  5  and  10).  The  combined  thmst  of  these 
substptive  lessons  reiterates  some  of  Newell's  (1973a,  1990)  original  points:  practical  predictive 
theories  must  take  into  account  "hardware  properties"  of  human  information  processing, 
environinental  task  demands,  personal  goals  based  on  these  demands,  and  preferred  strategies  for 
goal  attainment.  By  doing  so,  future  computational  models  of  multiple-task  performance  may 
realize  the  benefits  anticipated  in  our  methodological  lessons. 

Meanwhile,  important  questions  remain  with  respect  to  the  present  project.  For  example,  how 
close  do  our  EPIC  architecture  and  computational  models  currently  come  to  satisfying  established 
criteria  for  what  a  successful  UTC  should  be?  In  which  directions  should  future  research  go  to 
improve  EPIC  and  to  make  it  a  more  complete  veridical  theory?  Available  space  does  not  permit 
extensive  answers  here,  but  we  may  offer  a  few  partial  ones. 

Evaluation  of  EPIC 

Admittedly,  EPIC  lacks  essential  features  that  UTCs  should  have.  These  omissions  are  apparent 
from  evaluating  EPIC  with  respect  to  requisite  criteria  proposed  by  Newell  (1990, 1992;  cf.  Seifert 
&  Shafto,  1994).  According  to  Newell,  a  complete  veridical  UTC  ultimately  must  yield  accurate 


Experimental  psychologists  who  are  familiar  with  classical  English  literature  as  well  as  both  attention  theory  and 
practical  applications  to  human-factors  engineering  have  assured  us  that  Substantive  Lesson  14  is  indeed  apt  (N.  Moray, 
1996,  personal  communication). 


48 


Precis  to  A  Unified  Theory 


Meyer  &  Kieras 


predictive  computational  models  for  all  of  the  following  functions:  (a)  comprehending  and 
producing  natural  language;  (b)  storing  and  using  large  amounts  of  knowledge;  (c)  learning  from 
experience;  (d)  dealing  with  emotion  and  motivation;  (e)  solving  problems;  (f)  behaving  creatively; 
and  (g)  interacting  socially.  If  so,  then  there  is  a  long  way  to  go  before  EPIC  and  models  associated 
with  it  will  satisfy  these  criteria,  because  the  products  of  our  work  satisfy  none  of  them  yet. 

Nevertheless,  EPIC  and  its  associated  models  do  satisfy  some  of  Newell's  (1990, 1992)  other 
requisite  criteria  already.  These  include  (h)  having  a  precise  and  stable  architecture,  (i)  embodying 
detailed  perceptual  and  motor  mechanisms,  (j)  producing  rational  adaptive  goal-directed  behavior, 

(k)  finishing  tasks  in  real  time,  and  (1)  being  potentially  realizable  as  a  neural  system.  For  example, 
among  EPIC's  noteworthy  features  are  its  motor  processors  for  the  manual  and  ocular  response 
modalities,  and  its  enablement  of  efficient  executive  cognitive  processes  for  task  scheduling.  Also 
noteworthy  is  that  many  of  EPIC's  basic  assumptions  (e.g.,  ones  about  distributed  asynchronous 
parallel  information  processing)  are  consistent  with  human  brain  anatomy  and  neurophysiology 
(Kieras  &  Meyer,  1997;  Meyer  &  Kieras,  1997b;  cf.  Newell,  1990).  Given  these  virtues,  it  seems 
that  further  work  on  EPIC  perhaps  could  be  worthwhile. 

Directions  for  Future  Research 

The  preceding  evaluation  suggests  several  attractive  directions  for  future  research  with  EPIC 
and  new  computational  models  based  on  it. 

Specification  of  subdivisions  in  working  memory.  In  particular,  our  future  research  may 
specify  more  precisely  the  capacities,  durations,  and  interconnections  for  perceptual  and  motor 
subdivisions  of  EPIC's  working  memory.  Such  specifications  can  build  on  previous  studies  of  how 
working  memory  contributes  to  various  cognitive  processes  that  underlie  general  intelligence  (e.g., 
Baddeley,  1986;  Carpenter  &  Just,  1989;  Carpenter,  Just,  &  Shell,  1990;  Chase  &  Ericsson,  1983; 
Daneman  &  Carpenter,  1980;  Gilhooly,  Logie,  Wetherick,  &  Wynn,  1993;  Kyllonen  &  Christal, 
1990).  This  advance  will  help  EPIC  to  support  computational  models  of  reasoning,  problem  solving, 
and  language  processes.  It  also  will  enable  us  to  formulate  more  valid  measures  of  mental  workload, 
whose  current  practical  indicators  leave  much  to  be  desired  (cf.  Donchin  &  Gopher,  1986;  Moray, 
1979;  O'Donnel  &  Eggemeier,  1986;  Wierwille  &  Conner,  1983;  Willeges  &  Wierwille,  1979). 

Elaboration  of  declarative  long-term  memory.  Another  productive  extension  will  involve 
elaborating  EPIC's  long-term  memory  for  declarative  knowledge.  Thus  far,  EPIC  computational 
models  have  used  only  procedural  and  declarative  knowledge  in  working  memory.  However,  EPIC's 
long-term  memory  could  store  large  amounts  of  organized  propositional  declarative  knowledge  as 
well.  Realizing  this  potential  would  move  EPIC  further  toward  supporting  detailed  models  of 
language  processing.  It  also  would  help  set  the  stage  for  a  treatment  of  skill  acquisition,  which 
entails  compiling  procedural  knowledge  on  the  basis  of  task  instmctions  stored  as  propositional 
declarative  knowledge  in  long-term  memory  (J.  R.  Anderson,  1982;  Bovair  &  Kieras,  1991;  Bovair, 
Kieras,  &  Poison,  1990;  Fitts,  1964). 

Treatment  of  skill  acquisition.  We  anticipate  treating  skill  acquisition  thoroughly  in  the  context 
of  EPIC.  If  previous  conceptions  about  perceptual-motor  and  cognitive  skill  are  correct,  then  people 
may  pass  through  several  distinct  acquisition  phases  as  they  become  expert  performers.  For 
example,  J.  R.  Anderson  (1982)  has  proposed  an  initial  declarative  stage  of  skill  acquisition  followed 
by  several  later  procedural  substages  (cf.  Fitts,  1964).  During  the  declarative  stage,  performance  is 
presumably  mediated  by  propositional  knowledge  about  how  a  task  should  be  performed.  Using 
such  knowledge  apparently  requires  slow  controlled  verbal  interpretive  processes  that  lead  indirectly 
to  overt  action.  Nevertheless,  through  practice,  propositional  knowledge  about  proper  task 
performance  can  be  converted  to  executable  procedures  whereby  tasks  are  performed  directly  with 
sets  of  production  rules.  The  compilation  of  these  mles  and  the  gradual  refinement  of  them 
characterize  successive  substages  of  procedural  learning.  Because  EPIC  has  both  a  long-term 
memory  for  declarative  knowledge  and  a  production-system  formalism  for  procedural  knowledge,  it 
provides  natural  bases  with  which  to  characterize  various  stages  of  skill  acquisition.  Learning 
algorithms  such  as  those  proposed  by  J.  R.  Anderson  (1982)  and  others  (e.g.,  Bovair  &  Kieras,  1991; 
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Bovair,  Kieras,  &  Poison,  1990)  may  be  used  by  EPIC's  cognitive  processor,  enabling  the 
compilation  and  refinement  of  production  rules  for  performing  multiple  as  well  as  single  tasks. 

As  part  of  this  attractive  prospect,  an  important  objective  will  entail  describing  and  predicting 
how  flexible  strategies  of  task  scheduling  are  acquired  and  incorporated  into  evolving  executive 
processes.  That  such  acquisition  occurs  and  markedly  influences  eventual  performance  levels  has 
been  demonstrated  already  (e.g..  Gopher,  1993;  Lauber  et  al.,  1994;  Meyer  et  al.,  1995;  Schumacher 
et  al.,  1996,  1997).  We  know  specifically  that  the  efficacy  and  rate  of  learning  depend  on  what  types 
of  intermediate  training  are  provided.  Thus,  an  important  next  step  will  involve  specifying  the 
learning  algorithms  through  which  various  training  protocols  promote  both  optimized  temporal 
overlap  among  task  processes  and  efficient  allocation  of  limited  perceptual-motor  resources. 

Incorporation  of  energetic  mechanisms .  To  make  EPIC  computational  models  more  realistic, 
we  also  eventually  must  endow  them  with  "energetic"  mechanisms.  Drugs,  sleep  deprivation, 
emotional  arousal,  and  other  psychophysiological  variables  can  influence  rapid  human  performance 
significantly  (e.g.,  see  Beatty,  1982;  Broadbent,  1971;  Frowein,  1981).  Some  ideas  that  are  relevant 
to  how  these  influences  would  fit  within  EPIC  have  been  suggested  by  Gopher  (1986),  Sanders 
(1983),  and  others  (Hockey,  Gaillard,  &  Coles,  1986).  Different  factors  related  to  mental  and 
physical  "energetics"  might  selectively  modulate  the  estimated  magnitudes  of  EPIC  parameters  such 
as  the  perceptual  processors'  stimulus  detection  and  identification  times,  the  cognitive  processor's 
cycle  duration,  the  working  memory's  information -decay  times,  and  the  motor  processors' 
movement-production  times.  It  will  be  exciting  to  test  whether  "energetic"  effects  are  interpretable 
and  predictable  from  the  same  perspective  that  has  let  us  account  already  for  multiple-task 
performance  in  the  PRP  procedure,  HCI,  and  aircraft  cockpit  operation. 

Final  Lessons 

With  these  future  research  directions  before  us,  we  leave  behind  two  final  lessons  that  bezu* 
further  on  how  the  search  for  a  complete  veridical  UTC  should  be  viewed. 

Methodological  Lesson  13:  Psychological  science  can  (and  will)  he  fun.  As  the  intellectual 
approach  taken  by  the  Nobel  laureat  physicist,  Richard  Feynman,  has  amply  exemplified,  striving 
toward  theoretical  unification  need  not  entail  "all  work  and  no  play"  (Feynman,  1985;  Seifert, 

Meyer,  Davidson,  Patalano,  &  Yaniv,  1994).  Instead,  psychological  science  can  (and  will)  be  fun, 
because  prospective  UTCs  inspire  curiosity  and  excitement  while  leading  researchers  through 
surprising  twists  and  turns  along  the  path  of  discovery.  If  the  lessons  that  we  have  outlined  in  this 
chapter  (Tables  1  and  2)  convey  some  of  these  benefits,  then  our  principal  objective  for  now  will 
have  been  accomplished. 

Methodological  Lesson  14:  There’s  nothing  so  useful  as  a  good  theory.  Our  objective  also 
will  have  been  accomplished  insofar  as  researchers  come  to  appreciate  a  UTCs  potential  practical 
utility  more  fully.  This  is  the  keynote  on  which  we  began  the  present  chapter  and  on  which  we  end 
with  our  fourteenth  and  final  methodological  lesson.  "There's  nothing  so  useful  as  a  good  theory" 
(Gopher,  1996;  cf.  Lewin,  1951). 
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